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Title: Use of mvirine genomic regions identified to be involved in tumor 



developm ent for the development of anti-cancer drugs and diagnosis 
of cancer 



This application is a continuation-in-part of U.S. Patent Application Serial No. 
10/252,132 filed September 19, 2002. 

FIELD OF THE INVENTION 
5 The present invention relates to muidne genomic regions, identified by 

retroviral insertional tagging of mice as being involved in tumor development, 
in particular leukemia development, as well as himian'homologues thereof, 
and to the use of these genomic regions for the identification and development 
of anti-cancer drugs, such as small molecule inhibitors, antibodies, ribozymes, 

10 antisense molecules and RNA interference (RNAi) molecules, that are effective 
in reducing or eUminating the tumorigenic effects of genetic transformations in 
these genomic regions and/or eliminating the tvimorigenic effects of expression 
products thereof. The invention further relates to these anti-cancer drugs and 
to their use as pharmaceutical reagents for the treatment of cancer, as well as 

15 to pharmaceutical compositions comprising one or more of said pharmaceutical 
reagents and to methods for the treatment of cancer using said pharmaceutical 
compositions, in partictdar to methods of gene therapy. In yet further aspects, 
the invention relates to nucleic acids derived from said murine genomic 
regions involved in tumor development or fragments thereof, to antibodies 

20 raised against the expression products of genes the sequence of which is 

comprised in said mimne genomic regions or human homologues thereof, to 
antibodies raised against the expression products of genes affected by genetic 
transformations in said murine genomic regions or human homologues thereof^ 
to the use of said nucleic acids or antibodies as diagnostic reagents for the 

25 diagnosis of cancer, as well as to diagnostic compositions comprising one or 
more of said diagnostic reagents and to methods for the diagnosis of cancer 
using said diagnostic compositions. 
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BACKGROUND OF THE INVENTION 

After a quarter century of rapid advances, cancer research has 
generated a rich and complex body of knowledge reveeding cancer to be a 
5 disease involving dynamic changes in the genome. Cancer is thought to result 
from six essential alterations in cell physiology that collectively dictate 
mahgnant growth: self-suf&ciency in growth signals, insensitivity to growth- 
inhibitory (anti-growth) signals, evasion of programmed ceU death (apoptosis), 
limitless replicative potential, sustained angiogenesis, and tissue invasion and 
10 metastasis. 

In general, these essential alterations are the result of mutations in 
genes involved in the control of these cellular processes. These mutations 
include deletions, point mutations, inversions and amplifications. The 
mutations residt in either an aberrant level, timing, and/or location of 

15 expression of the encoded protein or a change in function of the encoded 
protein. These alterations can affect cell physiology either directly, or 
indirectly, for example via signaling cascades. 

The possibilities to identify genes that promote the transition firom a 
normal cell into a malignant cell is an essential prerequisite for the 

20 development of novel therapies for the treatment of cancer. 

One of the most common therapies for the treatment of cancer is 
chemotherapy. Herein, the patient is treated with one or more drugs that 
function as inhibitors of cellular growth and are therefore intrinsically toxic. 
Since cancer cells are among the fastest growing cells in the body, these cells 

25 £u:e severely affected by the applied drugs. However, normal cells are also 

affected which results - besides toxicity - in very severe side-effects such as loss 
of fertility. 

Another commonly used therapy to treat cancer is radiation therapy or 
radiotherapy. Radiotherapy uses high energy rays to damage cancer cells 
30 resulting in the induction of cell cycle arrest. Cell cycle arrest will ultimately 
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result in programmed cell death (apoptosis). However, also normal cells are 
irradiated and damaged. In addition, it is difi&cnlt to completely obliterate all 
tumor cells using this therapy. Importantly, very small tumors and developing 
metastases cannot be treated by radiotherapy. Moreover, irradiation can cause 
5 mutations in the cells surroxmding the tumor which increases the risk of 
developing new tumors. Combinations of both chemo- and radiotherapy are 
also frequently used and a subsequent accumulation of side-effects is observed. 

The major disadvantage of both therapies is that they do not 
discriminate between normal and tumor cells. Furthermore, tumor cells have 
10 the tendency to become resistant to these therapies, especially to 
chemotherapy. 

Therapies directed at tumor-specific targets would increase the 
efficiency of the therapy due to i) a decrease in the chance of developing drug 
resistance, ii) a reduced toxicity since the drugs used in these tumor-specific 

15 therapies are appUed at much lower concentrations and iii) observed side- 
effects are reduced since only tumor cells are affected. 

The use of tumor-specific therapies is limited by the number of targets 
known. Since tumors may arise firom a large number of different and discrete 
changes in the genome, the genotype between tumor cells may vary 

20 considerably. The disease type that they cause may however still be classified 
as the same. This is one of the main reasons why not a single therapy exists 
that is effective in all patients with a certain type of cancer. 

Pharmacogenetics and pharmacogenomics aim at determining the 
genetic determinants linked to diseases. Most of the disease are multigenic 

25 diseases, and the identification of the genes involved therein should allow for 
the discovery of new targets and the development of new drugs. 
Pharmacogenomics also encompasses the use of specific medications according 
to the genotype of the patient. This should lead to a dramatic improvement of 
the efficiency of the drugs. 
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Many physiological diseases are targeted by this novel pharmaceutical 
approach. One can name the autoimmune and inflammatory diseases, for 
example Addison's Disease, Alopecia Areata, Ankylosing Spondylitis, Behcet's 
Disease, Chronic Fatigue Syndrome, Crohn's Disease, Ulcerative Colitis, 
5 Inflammatory Bowel Disease, Diabetes and Multiple Sclerosis. 

As stated, cancers are also beheved to be multigenic diseases. Some 
oncogenes (for exemple ras, c-myc) and txmior suppressor genes (for example 
p53) have previously been identified, as well as some genetic markers for 
predisposition (for example the genes BRCAl and BRCA2 for breast cancer). 
10 The identification of new genes involved in other kinds of cancer should allow 
for a better information of the patient and the prevention of the development 
of the disease itself, an improved life expectancy as aheady observed with 
breast cancer (Schrag et al. 2000. JAMA 283: 617-24). 

Knowledge of the identity of genes involved in cancer development 
15 therefore greatly facilitates the development of prophylactic, therapeutic and 
diagnostic methods for this disease. Diagnosis of the affected genes in a certain 
tumor type allows for the design of therapies comprising the use of specific 
anti-cancer drugs, for instance, drugs directed against the proteins encoded by 
these genes. 

20 Presently, only a limited number of genes involved in tumor 

development are known and there is a clear need for the identification of novel 
genes involved in tumor development to be used in the design of tumor-specific 
therapies and to define the genot3T)e of a certain tumor. 

In the research that led to the present invention, a number of genomic 

25 regions were identified to be involved in tumor development by proviral 

tagging. Proviral tagging (Berns. 1988. Arch Virol.l02:l-18; Kim et ah 2003. J 
Virol. 77:2056-62) is a method that uses a retrovirus to infect normal 
vertebrate cells. After infection, the virus integrates into the genome thereby 
disrupting the local organization of the genome. This integration is random 

30 and affects the expression or function of genes, depending on the integration 
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site of the virus, which may for instance be in a coding region, a regulatory- 
region or a region nearby a gene. If a cellular gene involved in tumor 
development is affected, the ceU will acqxiire a selective advantage to develop 
into a tumor as compared to cells in which no genes involved in tumor 
5 development are affected. As a result, all cells within the tumor originating 
from the cell affected in a gene involved in tumor development wiU carry the 
same proviral integration. Through analysis of the region nearby the retroviral 
integration site, the affected gene can be identified. Due to the size of the 
genome and the total number of integration sites investigated in the present 
10 invention, a gene that is affected in two or more independent tumors 

investigated must provide a selective advantage and therefore contribute to 
tumor development. Such sites of integration are designated as common 
integration sites (CIS). 

By using an improved method of proviral tagging the present inventors 
15 were able to identijfy a large nxmiber of murine genomic regions involved in the 
development of myeloid leukemia in mice. AU such genomic regions comprised 
common integration sites of the virus nucleic acid and such sites were either 
found inside, before or after genomic sequences that coded for known or 
putative murine genes, or in yet unknown but defibaed positions within the 
20 murine genome. Many of the murine genomic regions, as defined by the 

presence therein of a common integration site, have so far not been reported to 
be involved in tumor development. Novel cancer related murine genomic 
regions identified in this manuaer are the following: Adamll, AI462175, Cd24a, 
Edg3, Itgp, Kcnjie, KcnkS, Kcnn4, Ltb, Lyl08, LySi, mouse homologue of 
25 EMILIN, Mrcl, Ninj2, Nphsl, Sema4b, Tm9sf2, and Tnfrsfl?, encoding ceU 
surface proteins; Apohec2, Btd, Cds2, Clpx, Ddxl9, Ddx21, Dnmt2, Dqxl, 
Hdac7a, Lce-pending, Mgatl, mouse homologue of CILP, mouse homologue of 
NOH61, Nudel-pending, Pah, Pdil, Ppia, Prpsl, Ptgds, and Vars2, encoding 
enzymes; Dagk4, mouse homologue of PSK, Nme2, Snfllk and Tyki, encoding 
30 kinases; DusplO, Inpp4a and InppSb, encoding phosphatases; 1116, Prg, and 
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Scya4, encoding secreted factors; Akap7, ApiS, Arfrpl, Arhgapl4-pending, 
Cish2, Dappl, Fabp6, Fkbp8, Flizl-pending, Hint, lerS, Jundp2-pending, 
Lmo6, Midi, mouse homologue olAKAPlS, mouse homologue of BIN2, mouse 
homologue of CEZANNE, mouse homologue of CHD2, mouse homologue of 
5 MBLL, mouse homologue of SLC16A10, mouse homologue of SLC16A6, mouse 
homologue of SLCl 7A5, mouse homologue of TAFSL, mouse homologue of 
UlSNRNPBP, mouse homologue of ZNF8, Mtap7, Myolc, Nkx2-3, Nsf, Pcdh9, 
Pkig, Prdx2, Pscdl, Psmbl, Psmel, Psme2, Rgll, Ril-pending, Saxl, Slcl4a2, 
SlcTal, SlcTall, Swap70, Txnip, and Ubl3, encoding signaling proteins; Clic3, 
10 Gtll-13, mouse homologue of NOLSA, and Vdac2, encoding structural proteins; 
ABTl-pending, Cthpl, Dermal, Ebf, Elf 4, Ldbl, mouse homologue of NRIDI, 
mouse homologue of ZER6, Rest, Tbp, Zfp238, Zfp287, and Zfp319, encoding 
proteins involved in transcriptional regvdation; Lrrc2, Satbl, Slfn4, and 
genomic regions with the following Celera identification codes mCG10290, 
15 mCGlOeiS, mCG11234, mCG11325, mCG11355, mCGllSOS, mCGll817, 
mCG12566, mCG12630, mCG12824, mCG13346, mGGl4143, mCG14155, 
mCG14342, mCG15141, mCG15321„ mCG16761, mCG16858, mCG17127, 
mCG17140, mCG17142, mCG17547, mCG17569, mCG17751, mCGl7799, 
mCGl7802, mCG17918, mCGl8034„ mCG1850. mCGl8663, mCG18737, 
20 mCG20276, mCG20905, mCG20994, mCG21403, mCG21505, mCG21529, 
mCG21530, mCG21803, mCG22014, mCG22045, mCG22386, mCG2258, 
mCG22772. mCG23032, mGG23035, mCG23069, mCG23075, mCG23120, 
mCG2543. mCG2824, mCG2947, mCG3038, mCG3729, mCG3760, mCG50409, 
mCG50651, mCG5068, mCG5070, mCG51393. mCG52252, mCG52498, 
25 mCG53009, mCG53724, mCG55023, mCG55075, mCG55198, mCG55265, 
mCG55512, mCG56069, mCG56089, mCG56746, mCG57132, mCG57265, 
mCG57617, mCG57827, mCG58254, mCG58345, mCG5900, mCG5905, 
mCG59368, mCG59375, mCG59533, mCG59662, mCG59810, mCG59997, 
mCG60526, mCG60833, mCG61221, mCG61661, mGG61897, mCG61907, 
30 mCG61943, mCG62177, mCG62971, mCG63537, mCG63601, mCG64346, 



wo 2004/016317 ^^PCT/NL2003/000583 



mCG64382, mCG64398, mCG65022, mCG65585, inCG65785, inCG66128, 
mCG66379, mCG66776, mCG66965, mCG7831, mCG7856, mCG8424, 
mCG9002, mCG9537, mCG9538, mCG9791, mCG9792, mCG9843, mCG9875, 
mCG9877, and mCG9880. These genes are also listed in Table 1 below. 

5 

SUMMARY OF THE INVENTION 

A first object of the present invention is to provide novel genes involved 
in t\imor development for use in the design of tumor-specific therapies. This 
object is, in the case of therapies for himians, achieved by using the human 

10 homologues of the murine genomic regions hsted in Table 1 for the 

development of inhibitors directed against the genes encoded or affected by 
these regions that are involved in tumor development and/or their e23)ression 
products and by using these inhibitors for the preparation of pharmaceutical 
cpmpositions for the treatment of cancer, preferably for the treatment of 

15 letikemia. 

These murine genomic regions were discovered by using an improved 
method of proviral tagging involving a nested PGR approach as a result of 
which the cloning step as conventionally performed may now be omitted, 
resulting in a speeded up process of gene identification. Further, the improved 

20 method resulted in the identification of essentially only myeloid lexikemia- 

related genes since the mice essentially only developed myeloid leukemia. Also 
the method supported the use of various murine leukemia viruses, which use 
surprisingly resulted in the identification of partly overlapping but otherwise 
essentially complementary sets of murine genomic regions involved in the 

25 development of murine myeloid leukemia. This then expands the possibilities 
of finding additional myeloid leukemia-related genes. 

Another improvement of the method of the present invention for the 
identification of genomic regions involved in the development of cancer, 
preferably of mmrine leukemia, resides in the use of an improved method for 

30 determining integration sites as being common. This method also involves a 
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powerful amplification procedure including nested PCR which is combined 
with a nucleic acid blotting proced\ire, preferably Southern, as illustrated in 
Figure 2, 

The method of the invention for the identification of genomic regions 
5 involved in the development of cancer comprises a first step of performing 
retroviral insertional mutagenesis of a subject, comprising infecting said 
subject with a tiimor inducing retrovirus. In a preferred embodiment, said 
subject is a mouse, preferably said mouse is an NIH-Swiss mouse or an FVB/N 
mouse and said retrovirus is Graff! mxirine 1.4 leukemia virus (Gr-1.4) and/or 

10 Cas-BR-M murine leukemia virus (Cas-BR-M MuLV). The method of the 
invention comprises a subsequent step of isolating chromosomal DNA from 
primary timiors developed in the infected subject, such as present in e.g. 
spleen, liver, thymus, and lymph node tissue of mice with a virus-induced 
leukemia. The method of the invention comprises a subsequent step of 

15 digesting said chromosomal DNA with a restriction enzyme capable of cutting 
at least once in the genome of said tumor inducing retrovirus and at least once 
in the chromosomal DNA of said subject. Preferably, said chromosomal DNA is 
digested with a restriction enzyme recognizing restriction sites located at 
known positions within the virus LTR sequence but which will be unique 

20 within the genomic sequence in each virus insertion. Preferably, the restriction 
sites are separated such that a region of the chromosomal DNA of the subject 
flanking the known viral sequence can be identified based on the nucleotide 
sequence of said chromosomal DNA region, i.e. said region comprising a 
sufficient number of contiguous nucleotides to allow comparison thereof with a 

25 database of known sequences. 

The method of the invention comprises a subsequent step of ligating the 
digested DNA to circular DNA, followed by amplification of the chromosomal 
DNA firagment flanking the known viral sequence. In a method of the present 
invention, PCR amplification is carried out by performing a first PCR reaction 

30 with said circizlar DNA using a first set of retrovirus (LTR; long terminal 
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repeat)-specific primers followed by performing a second nested PGR with the 
product of said first PGR reaction using a second set of retrovirus (LTR)- 
specific primers. 

Following the step of amplifying the chromosomal DNA fragment 
5 flanking the known viral sequence, the nucleotide sequence of said 

chromosomal DNA fragment is determined, optionally via a cloning step, but 
preferably by performing a direct sequencing reaction, the latter being possible 
owing to the powerful and specific amplification procedures used. The 
determined nucleotide sequence of said chromosomal DNA fragment is then 

10 compared to a database of known sequences in order to identify the subject's 
genomic region wherein the retrovirus was integrated thereby indicating the 
genomic region potentially involved in the development of cancer. 

In order to determine whether an integration site is a common 
integration site, the method of the invention for the identification of genomic 

15 regions involved in the development of cancer comprises a step of designing 
genomic region-specific amplification primers, preferably in a nested format 
and said primers being capable of hybridizing to the chromosomal DNA 
fragment flanking the known viral sequence, the sequence of which fragment 
was determined as described hereinabove and which fragment represents a 

20 genomic region comprising a virus integration site. Then, RNA or DNA is 

isolated from tumors to be analysed for the presence of a common integration 
site and an amplification reaction, preferably a nested (RT-)PGR reaction, is 
performed with one or more of the genomic region-specific piimers (also 
termed locus specific primers herein) and one or more of the virus-specific 

25 primers, preferably retrovirus (LTR)-specific primers. Next, the amphfication 
products are blotted and the blot is separately hybridized with a virus-specific 
probe and a genomic region-specific probe. Amplification products hybridizing 
with both probes are considered to represent common integration sites, such 
common integration sites indicating the genomic region involved in the 

30 development of cancer. 
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It is an aspect of the present invention to use the murine genomic 
regions of the present invention for the development of inhibitors directed 
against the genes encoded or affected by these genomic regions 8uid/or their 
expression products. 
5 In one embodiment of the present invention, the inhibitors are 

antibodies and/or antibody derivatives directed against the expression 
products of genes encoded by the genomic regions or affected by genetic 
transformations in the genomic regions Usted in Table 1. Therapeutic 
antibodies are for instance useful against gene expression products located on 
10 the cellular membrane and can be comprised in a pharmaceutical composition. 
Also, antibodies may be targeted to intracellular, e.g. cytoplasmic, gene 
products such as RNA's, polj^eptides or enzymes, in order to modulate the 
activity of these products. Preferably, such antibodies are in the form of 
intrabodies, produced inside a cancer cell. In addition, antibodies may be used 
16 for dehverance of at least one toxic compound linked thereto to a timior ceU. 

In a preferred embodiment of the presient invention, the inhibitor is a 
small molecule capable of modulating the activity or interfering with the 
function of the protein expression product of the genes encoded by the genomic 
regions or affected by genetic transformations in the genomic regions involved 
20 in timior development as defined herein. In addition, small molecules can also 
be used for dehverance of at least one hnked toxic compoimd to a tumor cell. 

On a different level of inhibition, nucleic acids can be used to block the 
production of proteins by destroying the mRNA transcribed from respective 
gene encoded by the genomic regions or affected by genetic transformations in 
25 the genomic regions. This can be achieved by antisense drugs, ribozymes or by 
RNA interference (RNAi). By acting at this early stage in the disease process, 
these drugs prevent the production of a disease-causing protein. The present 
invention relates to antisense drugs, such as antisense RNA and antisense 
ohgodeoxynucleotides, ribozjones and RNAi molecules, directed against the 



wo 2004/016317 



•CT/NL2003/000583 



11 

genes encoded by the genomic regions or affected by genetic transformations in 
the genomic regions listed ia Table 1 or transcription products thereof. 

The expression level of a gene can either be decreased or increased 
during tumor development. Naturally, inhibitors are used when the expression 

5 levels are elevated. However, the present invention also provides for 

"enhancers", to boost the expression level of a gene encoded by the genomic 
regions or aJBfected by genetic transformations in the genomic regions involved 
in tximor development and of which the expression levels are reduced. 
"Enhancers" may be any chemical or biological compound known or found to 

0 increase the expression level of genes, to improve the function of an expression 
product of a gene or to improve or restore the expression of a dysfunctional 
gene. 

Very suitable therapies to overcome reduced expression levels of a gene 
or to restore the expression of a dysfunctional gene encoded by the genomic 

5 regions or affected by genetic transformations in the genomic regions as 

disclosed herein include the replacement by gene therapy of the dysfunctional 
or affected gene or its regulatory sequences that drive the expression of said 
gene. The invention therefore relates further to gene therapy, in which a 
dysfunctional gene of a subject encoded by the genomic regions or affected by 

!0 genetic transformations in the genomic regions as hsted in Table 1 or a human 
homologue thereof or a dysfunctional regulatory sequence of a gene of a subject 
encoded by the genomic regions or affected by genetic transformations in the 
genomic regions as disclosed hsted in Table 1 or a human homologue thereof is 
replaced by a functional coxmterpart, e.g. by stable integration offer instance a 

!5 lentiviral vector comprising a functional gene or regulatory sequence into the 
genome of a subject's host cell which is a progenitor ceU of the txmior cell-line 
of the subject and grafting of said transfected host cell into said subject. 

The invention also relates to forms of gene therapy, in which the genes 
encoded by the genomic regions or affected by genetic transformations in the 

50 genomic regions Hsted in Table 1 or a human homologue thereof are i.a. used 
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for the design of dominant-negative forms of these genes which inhibit the 
function of their wild-tj^e coimterparts following their directed expression 
from a suitable vector in a cancer cell. 

Another object of the present invention is to provide a pharmaceutical 
5 composition for the treatment of cancer comprising one or more of the 
inhibitors, "enhancers", replacement compounds, vectors or host cells 
according to the present invention as a pharmaceutical reagent or active 
ingredient. The composition can further comprise at least one pharmaceutical 
acceptable additive like for example a carrier, an emulsi&er, or a conservative. 
10 In addition, it is the object of the present invention to provide a method 

for treatment of subjects suffering from cancer which method comprises the 
administration of the pharmaceutical composition according to the invention to 
patients in need thereof in a therapeutically effective amoxmt. 

A further aspect of the present invention relates to the use of the 
15 murine genomic regions or as disclosed herein as well as to human homologues 
thereof or their gene expression products for the development of reagents for 
the diagnosis of cancers. 

The invention also provides diagnostic compositions for diagnosing 
cancer, comprising the diagnostic reagents such as specific nucleic acid probes 
20 and/or specific antibodies capable of specifically binding to the mxirine genomic 
regions as disclosed herein as well as to human homologues thereof, to the 
transcription products of genes encoded thereby and/or to the expression 
products of such genes, respectively. 

Additionally, the present invention relates to methods for diagnosing 
25 cancer, preferably leukemia, by using a diagnostic composition of the present 
invention. 

In preferred aspects and embodiments of the present invention use is 
made of at least 2, preferably at least 4, more preferably at least 10, even more 
preferably at least 30 and most preferably all miirine genomic regions selected 
30 from the group consisting of Adamll, Akap7, Arpgapl4 , Bomb, Cd24a, Cish2, 
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Cig5, Clic3, Cra, Dermol, EMILIN, Flj20489, GalntS, Hook, Ier5, IL16, Iprgl, 
Itgp, KcnkS, lrrc2, Ltb, Mbll, Mrcl, Mtap7, Nmj2, Nrldl, PcdliQ, Prdx2, Prpsl, 
Pdil, PtgdS, Rgll, Sardh, Scya4, Slcl6A6, SwapTO, Txnip, Trim46, Tnfrsfl? 
and Ubl3 listed in Table 1. Using these genes very advantageous expression 
5 analysis in acute myeloid leukemia (AML) patients and/or diagnostic methods 
of leukemia may be performed as each of these genes appeared selectively up 
or down regulated (on the mRNA level) in one or more specific subgroups of 
AML. 

Particularly preferred is the use in aspects and embodiments of the present 
10 invention of at least 2, preferably at least 3, more preferably at least 4, and 

most preferably all genes of the group of murine genomic regions consisting of 

Cd24a, Cish2, Cra, Ltb and Prdx2 Hsted in Table 1. 

In addition to the newly foxmd genomic regions and genes afiected by 

transformations in such regions andof which the involvement in the 
15 development in tumor was hitherto unknown, use in aspects and embodiments 

of the present invention may also be made of gene sequences known to be 

involved in cancer development, such as for instance p53, Notch-1, Evi-l, NFl 

(Evi-2), Lck'l, Pim-1, HoxA9 (EviS), FlUl, Yyl, Pps, Ptpnl and N-Myc. 

20 DEFINITIONS 

The term "murine genomic region" or "genomic region" as used herein 
indicates the site of integration of the nucleic acid of the virus used in proviral 
tagging of (mouse) genomic DNA. Such a viral integration site is involved in 
development of cancer when it is identified as a common integration site and 

25 may be located inside a known or putative gene, nearby, e.g. before or after a 
known or putative gene such as in a regulatory sequence thereof, or in a yet 
unknown but defined position within the (murine) genome. As such, a "murine 
genomic region" herein indicates a gene, a putative gene, a region nearby a 
gene or putative gene, or a region of the murine genome the function of which 

30 is hitherto \mknown. Genes comprised in such genomic regions or genes of 
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which the expression is afifected by genetic transformations in such genomic 
regions are believed to be involved in cancer development as are the nucleotide 
sequences of the regions themselves and the expression products thereof 
"Expression products" may be protein and/or RNA. 

5 A "murine genomic region-related gene" as used herein is a gene which 

is fully or partially comprised in a murine genomic region of the present 
mvention or a gene affected by transformations therein, i.e. in general; a gene 
which function or expression is affected by a transformation in a murine 
genomic region of the invention. 
10 The term "target cell" as used herein indicates a eukaryotic cell, 

preferably an animal (including human), more preferably a mammahan cell, 
including a specialized tissue cell and progenitor cell thereof, which is capable 
of being infected or transfected by vector of the invention, or which is the 
target for a diagnostic reagent of the present invention. 

15 The term "producer cell" as \ised herein refers to a cell or cell-line, 

respectively, suitable for replication, propagation, and/or production, of a 
vector of the invention. 

The term "host cell" as used herein generally indicates a cell used for the 
expression of a viral genome, or propagation of a vector or virus and, in the 

20 context of the present invention, includes both the target and producer cell. 

The term "viral vector" refers to a nucleic acid construct comprising a 
viral genome capable of being transcribed in a host cell, which genome 
comprises sufficient viral genetic information to allow packaging of the viral 
RNA genome, in the presence of packaging components, into a viral particle 

25 capable of infecting a target cell. Infection of the target ceU includes reverse 
transcription and integration into the target cell genome, where appropriate 
for particular viruses. 

The term "isolated nucleic acid sequence" means a nucleic acid sequence 
that is free of the nucleotide sequences that flank the nucleic acid sequence in 

30 the naturally-occurring genome of the organism from which it is derived. The 
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term therefore includes, for example, any recombinant DNA which is 
incorporated into a vector; into an autonomously repHcating plasmid or into 
the genomic DNA of a prokaryote or eukaryote from which it has not been 
derived; or which exists as a separate molecule (e.g., an RNA, a cDNA or a 

5 genomic or cDNA fragment produced by PGR or restriction endonuclease 

digestion) independent of other sequences. It also includes a recombinant DNA 
which is part of a hybrid gene encoding additional poljrpeptide sequences. 

As used herein, "polynucleotide", "nucleic acid" or "oligonucleotide" 
includes reference to a deoxyxibonucleotide or ribonucleotide polymer in either 

10 single-or double-stranded form, and unless otherwise limited, encompasses 
known analogues of natural nucleotides that hybridize to nucleic acids in a 
manner similar to naturally occurring nucleotides. In specific embodiments, 
the "polynucleotide", "nucleic acid" or "oligonucleotide" can be substituted by 
chemical substances that can form sequence -specific interactions similar as for 

15 the natural phosphodiester "nucleic acid". Ejiown and preferred analogues 
include polymers of nucleotides with phosphorothioate or methylphosphonate 
Uaisons, or peptide nucleic acids. Unless otherwise indicated, a particular 
nucleic acid sequence includes the complementary sequence thereof. 

A coding sequence is "under the control" of regulatory sequences in a cell 

20 when RNA polymerase transcribes the coding sequence into mRNA encoded by 
the coding sequence. 

The term "coding sequence" as defined herein is a sequence which is 
transcribed into mRNA and optionally translated into a polypeptide when 
placed imder the control regulatory sequences. The boundaries of the coding 

25 sequence are generally determined by a translation start codon ATG at the 5'- 
terminus and a translation stop codon at the 3'-terminus. A coding sequence 
can include, but is not hmited to, RNA, DNA, cDNA, and recombinant nucleic 
acid sequences. 

The term "regtdatory sequences" is defined herein to include aU 
30 components which are necessary or advantageous for expression of a coding 
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sequence. Bach regxilatory sequence may be native or foreign to the coding 
sequence. Such regulatory sequences include, but are not limited to, a leader, a 
polyadenylation sequence, a propeptide sequence, a promoter, a signed 
sequence, and a transcription terminator. At a minimum, the regulatory 
5 sequences include a promoter, and transcriptionail and translational stop 
signals. The regulatory sequences may be provided with linkers for the 
purpose of introduciag specific restriction sites facilitating ligation of the 
regulatory sequences with the coding region of the nucleic acid sequence 
encoding a polypeptide. 

10 The term "promoter" is used herein for its art-recognized meaning to 

denote a portion of a gene containing DNA sequences that provide for the 
binding of RNA polymerase and initiation of transcription. Promoter sequences 
are commonly, but not always, found in the 5' non-coding regions of genes. 
The term "packaging components" refers to building blocks which are 

15 necessary for encapsulation of nucleic acid of a virus into mature viral 
particles. 

The terms "nucleotide sequence homology" or "sequence homology" or 
"homologous sequence" as used herein denote the presence of homology 
between two or more polynucleotides. Polynucleotides have "homologous" 

20 sequences if the sequence of nucleotides in their sequences is the same when 
ahgned for maximtim correspondence. Sequence comparison between two or 
more poljmucleotides is generally performed by comparing portions of two 
sequences over a comparison window to identify and compare local regions of 
sequence similarity. The comparison window is generally from about 20 to 200 

25 contiguous nucleotides. The "percentage of sequence homology" for 

polynucleotides, such as 50, 60, 70, 80, 90, 95, 98, 99 or 100 percent sequence 
homology may be determined by comparing two optimally ahgned sequences 
over a comparison window, wherein the portion of the polynucleotide sequence 
in the comparison window may include additions or deletions (i.e. gaps) as 

30 compared to the reference sequence (which does not comprise additions or 
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deletions) for optimal alignment of the two sequences. The percentage is 
calcxilated by: (a) determining the number of positions at which the identical 
nucleic acid base occurs in both sequences to yield the number of matched 
positions; (b) dividing the number of matched positions by the total number of 
5 positions in the window of comparison; and (c) mxiltiplying the resxilt by 100 to 
yield the percentage of sequence homology. Optimal ahgnment of sequences for 
comparison may be conducted by computerized implementations of known 
algorithms, or by visual inspection. Readily available sequence comparison and 
multiple sequence ahgnment algorithms are, respectively, the Basic Local 

10 Ahgnment Search Tool (BLAST) (Altschul, S.F. et ah 1990. J. Mol. Biol. 
215:403; Altschul, S.F. et al. 1997. Nucleic Acid Res. 25:3389-3402) and 
ClustalW programs both available on the internet. Other suitable programs 
include GAP, BESTFIT and FASTA in the Wisconsin Genetics Software 
Package (Genetics Computer Group (GOG), Madison, WI, USA). 

15 As used herein, the term "substantially homologous" for nucleic acid 

sequences should be interpreted as two nucleic acid sequences having a 
"percentage of sequence homology" of at least 40, preferably at least 60, more 
preferably at least 80, even more preferably at least 90, still more preferably at 
least 95, still more preferably at least 98, and most preferably at least 99 

20 percent nucleotide sequence homology. 

As used herein, the term "human homologue" should be interpreted as a 
himian (putative) gene having the same function as the (putative) gene 
identr&ed in mouse. A "human homologue" refers with respect to its respective 
mtirine gene as described in the present invention to a human gene which 

25 encodes an amino acid sequence which shows more than 45 %, preferably more 
than 60 %, more preferably more than 70 %, still more preferably more than 80 
%, even more preferably more than 90 % and most preferably more than 95 % 
identity with the amino acid sequence derived from the murine genes of the 
invention and is in the case of an enzyme the amino acid sequence of a 

30 polypeptide which shows the same tj^e of enzymatic activity as the enzyme 
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encoded murine genes of the invention. A "human homologue" of a murine 
genomic region, not being a (putative) gene, as defined herein indicates a 
genomic region within the h\mian genome that, when genetically transformed, 
results in an aberrant function of a gene which is identical in human and 
5 mouse. 

As used herein, the terms "antibody and "antibodies" refer to 
monoclonal antibodies, multispecific antibodies, synthetic antibodies, human 
antibodies, humanized antibodies, chimeric antibodies, single-chain Fvs (scFv), 
single chain antibodies, Fab fragments, F(ab') fragments, disulfide-linked Pvs 

10 (sdFv), and anti-idiotypic (anti-Id) antibodies (including, e.g., anti-Id 

antibodies to antibodies of the invention), and epitope-binding fragments of 
any of the above. In particular, antibodies of the present invention include 
immunoglobulin molecules and immunologically active portions of 
immunoglobulin molecxiles, i.e., molecules that contain an antigen binding site 

15 that immxmospecifically binds to a polypeptide antigen encoded by a gene 

comprised in the genomic regions or affected by genetic transformations in the 
genomic regions listed in Table 1. The immunoglobulin molecules of the 
invention can be of any type {e.g., IgG, IgE, IgM, IgD, IgA and IgY), class (e.g., 
IgGi, IgG2, IgGs, IgG4, IgAi and IgA2) or subclass of immunoglobulin molecxde. 

20 The term "oligonucleotide array" refers to a substrate having a two- 

dimensional surface having at least two different features. OUgonucleotide 
airrays preferably are ordered so that the locahzation of each feature on the 
siu*face is spotted. In preferred embodiments, an array can have a density of at 
least five hundred, at least one thousand, at least 10 thousand, at least 100 

25 thousand features per square cm. The substrate can be, merely by way of 
example, glass, sihcon, quartz, polymer, plastic or metal and can have the 
thickness of a glass microscope sUde or a glass cover slip. Substrates that are 
transparent to light are useful when the method of performing an assay on the 
chip involves optical detection. As used herein, the term also refers to a probe 

30 array and the substrate to which it is attached that form part of a wafer. The 
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substrate can also be a membrane made of polyester or nylon. In this 
embodiment, the density of features per square cm is comprised between a few 
units to a few dozens. 

"Modulating" according to the present invention should be understood 
5 as regulating, controlling, blocking, inhibiting, stimulating, enhancing, 

activating, mimicking, bypassing, correcting, removing, and/or substituting 
said gene expression or said route, in more general terms, intervening in said 
gene expression or said route. 

"Subject" as used herein includes, but is not limited to, an organism; a 
10 mammal, including, e.g., a human, non-human primate, mouse, pig, cow, goat, 
cat, rabbit, rat, guinea pig, hamster, horse, monkey, sheep, or other non- 
human mammal; and a non-mammal, including, e.g., a non-mammahan 
vertebrate, such as a bird (e.g., a chicken or duck) or a fish, and a non- 
mammahan invertebrate. 

15 

LEGEND TO THE FIGURES 

Figure 1 is a graphic representation of a directed PGR on chromosomal 
DNA to determine commonality of inverse PGR identified virus integration site 
as described in Example 3. Herein, primers XI, XIN, X2 and X2N are designed 

20 in a locus or genomic region that was identified as a virus integration site by 
inverse PGR. To amplify flanking genomic sequences and to determine the 
locaUzation and orientation of the integrated provirus, 4 different nested PGRs 
were performed. First, the primers XI and X2 were combined with the Graffi- 
1.4 MuLV LTR primers LI and L2, These products were amplified by a nested 

25 PGR, using the primers XIN and X2N in combination with LIN and L2N. The 
specificity of the amplified bands was checked using probes PI and P2 in 
Southern blot analysis. 

Figure 2 illustrates the method of the present invention for the 
identification of disease loci. IPGR or RT-PCR was performed on RNA or DNA 

30 from cell lines (DA and NFS), or GSL leukemias. The residting virus flanking 
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firagments were subjected to sequence analysis. LTR- and locus-specific 
primers were designed and used in a nested PGR strategy (*), i.e. LTR- 
1/primer-A PGR, followed by a LTR-2/primer-B amplification using genomic 
DNA fi-om a panel of CasBr-M MuLV-induced leukemias. PGR products were 

5 electrophoresed on a 1.5% agarose gel and subsequently blotted. Blots were 
first hybridized with locus-specific probe C and exposed to film, and after 
stripping rebybridized with probe LTR3. Bands hybridizing with the LTR3 as 
well as the locus specific probe G (lanes 1, 2, 3, 6 and 8) were considered 
positive, i.e. tumors carrying this particular common viral integration site 

10 (cVIS). Hybridization with one primer only (lanes 4 and 7) are false positives. 
In each experiment two or three positive fragments were cloned and nucleotide 
sequenced to confirm specificity 

DETAILED DESGRIPTION OF THE INVENTION 

15 As outlined above, knowledge of the identity of genes involved in cancer 

development greatly facilitates the development of prophylactic, therapeutic 
and diagnostic methods for this disease. The discovery of a great number of 
additional cancer genes and potential cancer genes now provides for the tools 
to detect and treat cancer and to provide for prophylactic, therapeutic and 

20 diagnostic compositions useful for treating subjects suffering from cancer or 
diagnosing subjects suspected of having the disease or even diagnosing the 
severity or type of the disease. 

In a preferred embodiment, the present invention is aimed at providing 
methods, substances, compositions, and uses according to the invention for the 

25 treatment of leukemia, even more preferably for the treatment of acute 
myeloid leukemia (AML). 

Leukemia is a type of cancer that originates in the bone marrow where . 
the blood forming cells (hematopoietic stem cells and progenitor cells) in adults 
reside. Lexikemia is an uncontrolled production and accumulation of cancerous 

30 blood cells that have usually partly or completely lost their abihty to develop 
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into differentiated and functional blood cell types. Leukemias can be divided in 
clinically distinct categories with largely different prognosis and requirements 
for the type of treatment. Based on clinical and cytological parameters, 
leukemias are grossly classified in the following categories: chronic myeloid 
leukemia (CML), chronic lymphocytic leukemia (CLL), acute myeloid leukemia 
(AML) and acute lymphoblastic le\ikemia (ALL). The clinico-pathological 
characteristics of these types of leukemia are distinct and reqture different 
forms of treatment with variable aggressiveness and treatment-related 
morbidity and mortality. 

In the past two decades, a further refinement of the above-mentioned 
classification has been developed based on immunological, cytogenetic and, 
more recently, molecular genetic parameters. This refinement has resxilted in 
improvements in treatment selection and a better prediction of the prognosis 
and treatment responses of patients and has formed the basis for the 
categorization of leukemia patients into good risk, standard risk and poor risk 
categories. However, these categories sure stUl heterogeneous and further 
refinement is urgently required. 

In addition, further insights in the pathogenetic and pathophysiological 
mechanisms underlying this heterogeneity is urgently needed as it would 
provide an invaluable source of information for the development of new 
therapies aimed at specific pathogenetic mechanisms. This will not only result 
in reduced mortahty and morbidity with equally efficient therapeutic impact, 
but also in the development of curative therapies for patients that are 
refiractory to the currently available treatment options for leukemia. 

Many of the genes affected by retroviral integration, as described in 
more detail in the examples below, appeared to be related to signaling, some of 
which have already been linked to human leukemia. For instance, in case of 
the Graffi-1.4 MuLV, the Tie-1 gene, which encodes a tyrosine kinase receptor 
that is normally expressed in vascular endothehal cells and hematopoietic 
stem cells, is overexpressed in chronic myeloid letikemia (CML). Importantly, 
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high Tie-1 levels inversely correlate with survival of CML patients in early 
chronic phase. Strongly increased levels of Tie-1 were also detected in bone 
marrow samples from MDS (Myelodysplastic Syndromes) and AML patients. 
Notch-1 is another example of a CIS associated with human disease. The 
5 human homologue of the Notch (Tan-1) gene is involved in chromosomal 

translocation t(7;9)(q34;34.3) which results in a truncated receptor in human 
T-lymphoblastic neoplasms. Notch-1 is a proviral integration site in mouse 
lymphoid leukemias . Our data suggest that aberrant Notch signaling may 
also be involved in myeloid leukemia. Notably, the integrations in the Notch 
10 gene predictably result in the constitutive formation of the trxmcated active 

form of Notch, which interferes with G-CSF-induced myeloid differentiation in 
the 32D cell model. 

Thus, the newly detected involvement of the gene sequences or murine 
genomic regions as disclosed herein, allows for the identification of novel 
15 routes of disease development and for a better imderstanding in the 

cooperative action between genes in pathogenesis of cancer, in particular of 
leukemia and thus for a better diagnosis and treatment of the disease. 

The implications of the findings reported herein are not limited to 
murine cancers. Many of the murine genomic regions identified herein also 
20 play an important role in cancer development in hiunans and other mammals. 
Examples of genes that are involved in mouse as well as human leukemia are 
for instance the tumor suppressor gene Evi2/Nfl (Buchberg et aLy Mol Cell 
Biol, 10, 4658-4666, 1990; Shaimon et al, NEngl J Med, 330, 597-601, 1994; 
Side et aZ., Blood, 92, 267-722, 1998), or the oncogenes Nmyc-1 (Hirvonen et al. 
25 Leuk Lymphoma, 11, 197-205, 1993; Setoguchi et al., Mol Cell Biol, 9, 4515- 
4522, 1989), Evil (Morishita et al Oncogene Res, 5, 221-231, 1990; Morishita 
Cell, 54, 831-840, 1988), Evi6/Hoxa9 (Nakamura et al, Nat Genet, 12, 154-158, 
1996; Nakamura et al, Nat Genet, 12, 149-153, 1996), Bcll/CyclinDl (de Boer 
et al, 1997 Ann Oncol, 8, 109-117; Silver & Buckler, J Virol, 60, 1156- 
30 1158,1986), Erg (Shimizu et al, Proc Natl Acad Sci USA, 90, 10280-10284 
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1993; Valk et al. Nucleic Acids Res, 25, 4419-4421, 1997) or spi-1/Pu.l 
(Mueller BU et aL, Blood 101(5):2074, 2003). An overview of cxirrently known 
murine cancer genes identified in retroviral screens can be found on the 
internet address http://genome2,ncifcr£ gov/RTCGD/ . Ldkewise, it is therefore 
5 predicted that a large number of the genes as disclosed herein is involved in 
cancer development in humans as well. 

Mutations are j&requently observed in genes encoding growth factors, 
growth factor receptors and other membrane proteins, kinases, phosphatases 
and other regtdatory enzymes, transcription regulators and proteins critical in 
10 the process of survival and apoptosis. Multiple "leukemia genes" have been 
demonstrated to be disease genes responsible for the development of certain 
other types of mahgnancies as well, e.g. Ras (Bos JL, et al Blood. 69:1237- 
41,1987; , Gahana C. et al., Mol Carcinog. 14:286-93, 1995; van 't Veer et aL, 
Oncogene 2:157-65, 1988), p55 (Carson DA, Lois A., Lancet. 14; 346(8981): 1009- 
15 11, 1995). NF'l (McLaughlin ME, Jacks T, Methods Mol BioL 222: 223-237, 
2003; Nishi T, Saya H., Cancer Metastasis Rev. 10: 301-310, 1991) or C-kit 
(Rubin BP et al, Cancer Res. 61 :8118-8121, 2001). 

Acute myeloid leukemia (AML) is the most frequent form of acute 
leukemia in adults and is one of the most aggressive forms of leukemia, which 
20 is acutely life threatening imless treated with different kinds of chemotherapy. 
Depending on the AML subtype determined by various clinical parameters, 
including age, and laboratory findings, for instance cs^togenetic features, 
allogeneic stem cell transplantation might foUow the remission induction by 
chemotherapy. The 5 years overall and disease fi:ee sxurvival rate of adult AML 
25 is currently in the order of 35-40%. There is a strong need for a more precise 
diagnosis of AML, which allows for better distinction between the prognostic 
subtypes and for new therapeutic strategies for the large contingent of patients 
that can not be cxired to date. The currently available laboratory techniques 
allow for a prognostic classification, but this is still far from optimal. Still, 
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most patients cannot satisfactorily be risk-stratified and still a majority of 
patients are not c\xred by currently available treatment modalities. 

The pathogenesis of leukemia is complex. Before becoming clinically 
overt, leukemic cells have acquired multiple defects in regulatory genes that 
5 control normal blood cell production. In human leukemia, until now only few of 
these genes have been identified, mainly by virtue of the fact that these genes 
were located in critical chromosomal regions involved in specific chromosome 
translocations found in human AML. Studies in mice, particularly those 
involving retroviral tagging, have yielded only relatively small numbers of 

10 retroviral insertions and target genes per study, but have nonetheless made 
clear that there are at least a few hundred genes that can be involved in the 
pathogenesis of murine leukemia. Based on the strong conservation between 
the mouse and human hematopoietic systems, as is for instance evident firom 
the fact that the biological properties of the hematopoietic progenitor cells and 

15 the regulators (hematopoietic growth factors) are largely similar, it is very 
likely that a substantial proportion of the genes (or their close family 
members) impHcated in mouse leukemia, can also be involved in the 
development of human disease. 

In order to substantiate the claim that the presently disclosed genes 

20 play an important role in human leukemia we investigated 288 cases of human 
AML. These consisted of AML cases with internal tandem dupHcations in the 
gene encoding the tyrosine kinase receptor FLT3 (FLT3-ITD), which feature is 
associated with a poor prognosis, as weU as AML cases with a t(8;21) 
translocation or patients with a t(15;17) translocation, which features fall 

25 within the favourable risk categories. For 126 of the 237 genes as disclosed 
herein, a study for their potential involvement said 288 cases of human AML 
was performed. The study included the determinations of 1) whether the genes 
could be detected, 2) whether the expression of the genes deviated significantly 
from expression levels in normal human bone marrow cells, in particular the 

30 immatxire (CD 34+) subset, and 3) to what extent these genes might be 
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differentially expressed in the most significant prognostic subgroups of AML 
that are defined to date. The results of these determinations indicated that the 
genes as disclosed herein provide for additional diagnostic power in detecting 
as well as in differentiating AML. 
5 Although a large number of the murine genomic regions as disclosed in 

the present invention, i.e. in Table 1, encode known genes, i.e, encode for 
known murine genes, the relevance of these genes in the development of 
myeloid leukemia was not previously recognized. 

Furthermore, the remainder of the murine genomic regions involved in 
10 tumor development as disclosed in the present invention comprise unknown 
nucleic acid sequences, i.e. do not show any significant homology to sequences 
that encode known murine genes as available in present-day databases and 
comprise i.a. putative gene sequences. Nonetheless, the involvement of these 
murine genomic regions in the development of myeloid leukemia was 
15 demonstrated for all nucleic acid sequences as hsted in Table 1. 

The finding that these murine genomic regions are involved in the 
development of myeloid leukemia does not preclude their role in other forms of 
leukemia or even other cancers. Therefore, the murine genomic regions as 
hsted in Table 1 may also play an important role in cancers such as bone 
20 cancers, brain tumors, breast cancer, endocrine system cancers, 

gastrointestinal cancers, gynaecologic cancers, head and neck cancers, lung 
cancers, lymphomas, metastases, myelomas, pediatric cancers, penile cancer, 
prostate cancer, sarcomas, skin cancers, testicxilar cancer, thyroid cancer and 
urinary tract cancers. 
25 The present invention provides in one aspect for nucleic acid sequences 

suitable for therapeutic and/or diagnostic use. 

Murine genomic regions disclosed in the present invention are useful for 
developing therapeutic and diagnostic methods. For example, a diagnostic 
assay to determine the stage of the disease may be developed and may also be 
30 useful in tailoring the treatment of aggressive versus milder cancer or tumor 
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progression. For instance it is known that the t(15;17) translocation restdts in 
a mutation in the gene encoding a all-trans retinoic acid (ATRA) receptor in 
differentiating blood cells and residts in an impaired ATRA receptor activity, 
required for proper differentiation. As a result, differentiation of the cells is 
impaired leading to lexikemia. A well accepted treatment is cxirrently the 
administration of high doses of ATRA in order to allow the cells to 
differentiate- This demonstrates that knowledge of the molecular mechanism 
of cancer may provide important implications for therapy. 

The murine genomic regions, genes comprised therein and polj^eptides 
encoded thereby as well as genes and their encoded polypeptides affected by 
transformations in murine genomic regions of the invention are useful for 
designing diagnostic reagents useful in such methods. 

Further, modulation of genes or gene expression products that are mis- 
regulated can be used to treat or ameUorate cancer, tumor progression, 
hyperprohferative cell growth, and the accompanying physical and biological 
manifestations. For example, the murine genomic regions provided herein, can 
be used to construct nucleic acid and polypeptide compositions useful for 
treatment, such as antisense, ribozymes, antibodies, vaccine antigens, and 
immune system inducers, for example. Thus, the present invention relates to 
methods and reagents for diagnosis, and to methods and compositions for 
treatment. 

In one aspect, use is provided of nucleic acids having a sequence of one 
or more of the cancer-related genes as disclosed herein to obtain full-length 
cDNA and full-length human gene and promoter regions. One such use is the 
generation of the polypeptides encoded by the gene sequence. 

Polypeptide production. 

The polynucleotide, the corresponding cDNA, or the full-length gene 
encoded in a murine genomic region of the invention may be identified by 
using computer algorithms for the identification of open reading frames 
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(ORFs) therein. Subsequently, full length cDNAs comprising the complete or 
partial coding sequences may be prepared synthetically and ligated into a 
stiitable expression system to yield a polynucleotide construct of the invention. 
Appropriate polyTiucleotide constructs are purified using standard 
5 recombinant DNA techniques as described in, for example, Sambrook et al., 
(1989) Molecular Cloning: A Laboratory Manual, 2nd ed. (Cold Spring Harbor 
Press, Cold Spring Harbor, N.Y.). The polypeptides encoded by the 
polynudeotides are expressed in any e3q)ression system, including, for 
example, bacterial, yeast, insect, amphibian and mammalian systems. Suitable 

10 vectors and host cells are described in U.S. Pat. No. 5,654,173. 

Expression systems in bacteria include those described in Chang et al.. 
Nature (1978) 275:615, Goeddelet al.. Nature (1979) 281:544, Goeddel et al.. 
Nucleic Acids Res. (1980) 8:4057; EP 0 036,776, U.S. Pat. No. 4,551,483, 
DeBoer et al., Proc. Natl. Acad. Sci. (USA) (1983) 80:21-25, and Siebenlist et 

15 al., CeU (1980) 20:269. 

Expression systems in yeast include those described in Hinnen et al., 
Proc. Natl. Acad. Sci. (USA) (1978) 75:1929; Ito et aL, J. Bacteriol. (1983) 
153:163; Kurtz et al., Mol. CeU. Biol. (1986) 6:142; Kunze et al., J. Basic 
Microbiol. (1985) 25:141; Gleeson et al., J. Gen. Microbiol. (1986) 132:3459, 

20 Roggenkamp et al., Mol. Gen. Genet. (1986) 202:302) Das et al., J. Bacteriol. 

(1984) 158:1165; De Louvencourt et al., J. Bacteriol. (1983) 154:737, Van den 
Berg et al., Bio/Technology (1990) 8:135; Kunze et al., J. Basic Microbiol. 

(1985) 25:141; Cregg et al., Mol. Cell. Biol. (1985) 5:3376, U.S. Pat. Nos. 
4,837,148 and 4,929,555; Beach and Nxurse, Nature (1981) 300:706; Davidow et 

25 al., Curr. Genet. (1985) 10:380, Gaillardin et al., Curr. Genet. (1985) 10:49, 

Ballance et al., Biochem. Biophys. Res. Commtm. (1983) 112:284-289; Tilbum 
et al., Gene (1983) 26:205-221, Yelton et al., Proc. Natl. Acad Sci. (USA) (1984) 
81:1470-1474, KeUy and Hynes, EMBO J. (1985) 4:475479; EP 0 244,234. and 
WO 91/00357. 
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E:q)ression of heterologous genes in insects is accomplished as described 
in U.S. Pat. No. 4,745,051, Friesen et al. (1986) "The Regulation of Baculovirus 
Gene Expression" in: The Molecular Biology Of Bacxxloviruses (W. DoerQer, 
ed.), EP 0 127,839, EP 0 155,476, and Vlak et al., J. Gen. ViroL (1988) 69:765- 
5 776, Miller et al., Ann. Rev. Microbiol. (1988) 42:177, Carbonell et al.. Gene 
(1988) 73:409, Maeda et al., Nature (1985) 315:592-594, Lebacq-Verheyden et 
al., Mol. Cell. Biol. (1988) 8:3129; Smith et al., Proc. Natl. Acad. Sci. (USA) 
(1985) 82:8404, Miyajima et al., Gene (1987) 58:273; and Martin et al., DNA 
(1988) 7:99, Nximerous baculoviral strains and variants and corresponding 

10 permissive insect host cells from hosts are described in Luckow et al., 

Bio/Technology (1988) 6:47-55, Miller et al.. Generic Engineering (Setlow, J. K. 
et al. eds.), Vol. 8 (Plenum PubUshing, 1986), pp. 277-279, and Maeda et al., 
Nature, (1985) 315:592-594. 

Mammalian expression is accomplished as described in Dijkema et al., 

15 EMBO J. (1985) 4:761, Gorman et al., Proc. Natl. Acad. Sci. (USA) (1982) 

79:6777, Boshart et al.. Cell (1985) 41:521 and U.S. Pat. No. 4,399,216. Other 
features of mammahan expression are facihtated as described in Ham and 
Wallace, Meth. Enz. (1979) 58:44, Barnes and Sato, Anal. Biochem. (1980) 
102:255, U.S. Pat. Nos. 4,767,704, 4,657,866, 4,927,762, 4,560,655, WO 

20 90/103430, WO 87/00195, and U.S. Pat. No. RE 30,985. 

Polynucleotide molecules or parts thereof comprising the murine 
genomic regions Usted in Table 1 or comprised therein are propagated by 
placing the molecule in a vector. Viral and non-viral vectors are used, 
including plasmids. The choice of plasmid will depend on the type of cell in 

25 which propagation is desired and the purpose of propagation. Certain vectors 
are useful for amplifying and making large amounts of the desired DNA 
sequence. Other vectors are suitable for expression in cells in culture. Still 
other vectors are suitable for transfer and expression in cells in a whole animal 
or person. The choice of appropriate vector is well within the skill of the art. 

30 Many such vectors are available commercially. The polynucleotide is inserted 
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into a vector typically by means of DNA ligase attachment to a cleaved 
restriction enzyme site in the vector. Alternatively, the desired nucleotide 
sequence may be inserted by homologous recombination in vivo. Typically this 
is accomplished by attaching regions of homology to the vector on the flanks of 
5 the desired nucleotide sequence. Regions of homology are added by hgation of 
oUgonudeotides, or by pol3mierase chain reaction using primers comprising 
both the region of homology and a portion of the desired nucleotide sequence, 
for example. 

Polynucleotides are hnked to regulatory sequences as appropriate to 
10 obtain the desired expression properties. These may include promoters 

(attached either at the 5' end of the sense strand or at the 3' end of the 

antisense strand), enhancers, terminators, operators, repressors, and inducers. 

The promoters may be regulated or constitutive. In some situations it may be 

desirable to use conditionally active promoters, such as tissue-specific or 
15 developmental stage-specific promoters. These are linked to the desired 

nucleotide sequence using the techniques described above for linkage to 

vectors. Any techniques known in the art may be used. 

When any of the above host cells, or other appropriate host cells or 

organisms, are used to replicate and/or express the polynucleotides or nucleic 
20 acids of the invention, the resulting rep heated nucleic acid, RNA, expressed 

protein or polypeptide, is within the scope of the invention as a product of the 

host cell or organism. The product is recovered by any appropriate means 

known in the art. 

Once the gene corresponding to a polypeptide is identified, its 
25 expression can be regulated in the ceU to which the gene is native. For 

example, an endogenous gene of a cell can be regulated by an exogenous 

regulatory sequence as disclosed in U.S. Pat. No. 5,641,670, "Protein 

Production and Protein Dehvery." 

30 Identification of Secreted and Membrane-Bound Polypeptides 
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Both secreted and membrane-bound polypeptides encoded in miirine 
genomic regions or affected thereby in any way are of interest. For example, 
levels of secreted polypeptides can be assayed conveniently in body fluids, such 
as blood or urine. Membrane-bound polypeptides are useful for constructing 
5 vaccine antigens, for inducing an immune response, or for diagnosis of whole 
cells. Such antigens would comprise aU or part of the extracellular region of 
the membrane-bound pol3T)eptides. 

Because both secreted and membrane-bound poljrpeptides comprise a 
fragment of contiguous hydrophobic amino acids, hydrophobicity predicting 
10 algorithms can be used to identify such polypeptides. 

A signal sequence is usually encoded by both secreted and membrane- 
boimd polypeptide genes to direct a pol5rpeptide to the sxirface of the ceU. The 
signal sequence usually comprises a stretch of hydrophobic residues. Such 
signal sequences can fold into helical structures. 
15 Membrane-bound pol3Tpeptides typically comprise at least one 

transmembrane region that possesses a stretch of hydrophobic amino acids 
that can transverse the membrane. Some transmembrane regions also exhibit 
a hehcal structure. 

Hydrophobic fragments within a pol3T)eptide can be identified by using 
20 computer algorithms. Such algorithms include Hopp & Woods, Proc. Natl. 
Acad. Sci. USA 78: 3824-3828 (1981); Kyte & Doolittle, J. MoL Biol. 157:105- 
132 (1982); and RAOAR algorithm, Degli Esposti et al., Eur. J. Biochem. 190: 
207-219 (1990). 

Another method of identifying secreted and membrane-boimd 
25 polypeptides is to translate the present polynucleotides, listed in Table 1, and 
determine if at least 8 contiguous hydrophobic amino acids are present. Those 
translated polypeptides with at least 8; more typically, 10; even more typically, 
12 contiguous hydrophobic amino acids are considered to be either a putative 
secreted or membrane bound polj^eptide. Hydrophobic amino acids include 
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alanine, glycine, histidine, isoleucine, leucine, lysine, methionine, 
phenylalanine, proline, threonine, tryptophan, tyrosine, and valine. 

Construction of Polypeptides of the Invention and Variants Thereof 
5 The poljrpeptides useful for the development of therapeutic reagents 

include those encoded by the nucleotide sequences of genes or putative genes 
as disclosed herein. These nucleotide sequences can also be encoded by nucleic 
acids that, by virtue of the degeneracy of the genetic code, are not identical in 
sequence to the disclosed nucleotide sequences. Thus, the invention includes 
10 within its scope isolated nucleic acids comprising a nucleotide sequence 
encodiag a protein or polypeptide expressed by said isolated nucleic acid 
having the sequence of any one of the gene sequences listed in Table 1 or 
having a sequence substantially homologous thereto. Also within the scope of 
the invention are variants; variants of polypeptides include mutants, 
15 fragments, and fusions. Mutants can include amino acid substitutions, 

additions or deletions. The amino acid substitutions can be conservative amino 
acid substitutions or substitutions to eUminate non-essential amino acids, such 
as to alter a glycosylation site, a phosphorylation site or an acetylation site, or 
to minimize misfolding by substitution or deletion of one or more cysteine 
20 residues that are not necessary for function. Conservative amino acid 
substitutions are those that preserve the general charge, 

hydrophobicity/hydrophihcity, and/or steric bvdk of the amino acid substituted. 
For example, substitutions between the following groups are conservative: 
Gly/Ala, Val/Ile/Leu, Asp/Glu, Lys/Arg, Asn/Gln, Ser/Cys,Thr, and 
25 Phe/Trp/Tyr. The genetic code can be used to select the appropriate codons to 
construct the corresponding variants. 

Small molecule inhibitors 

Small molecule inhibitors are usually chemical entities that can be 
30 obtained by screening of already existing hbraries of compounds or by 
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designing compounds based on the structure of the protein encoded by a gene 
involved in tumor development. Briefly, the structure of at least a fragment of 
the protein is determined by either Nuclear Magnetic Resonance or X-ray 
crystallography. Based on this structure, a virtual screening of compounds is 
5 performed. The selected compounds are synthesized using medicinal and/or 
combinatorial chemistry and thereafter analyzed for their inhibitory effect on 
the protein in vitro and in vivo. This step can be repeated until a compound, is 
selected with the desired inhibitory effect. After optimization of the compoxmd, 
its toxicity profile and efficacy as cancer therapeutic is tested in vivo using 

10 appropriate animal model systems. 

Differentially expressed genes that do not encode membrane-boimd 
proteins are selected as targets for the development of small molecule 
inhibitors. To identify putative binding sites or pockets for small molecules on 
the surface of the target proteins, the three-dimensional structure of those 

15 targets are determined by standard crystaUization techniques (de Vos et al. 
1988, Science 239:888-93; WiQiams et aL 2001, Nat Struct Biol 8:838-42). 
Additional mutational analysis may be performed to confirm the functional 
importance of the identified binding sites. Subsequently, Cerius2 (Molecular 
Simxilations Inc., San Diego, CA, USA) and Ludi/ACD (Accelrys Inc., San 

20 Diego, CA, USA) software is used for virtual screening of small molecule 

hbraries (Bohm. 1992 J Comp Aided Molec Design 6:61-78). The compoxmds 
identified as potential binders by these programs are synthesized by 
combinatorial chemistry and screened for binding affinity to the targets as well 
as for their inhibitory capacities of the target protein's function by standard in 

25 vitro and in vivo assays. In addition to the rational development of novel small 
molecules, existing small molecule compound libraries are screened using 
these assays to generate lead compounds. Lead compounds identified are 
subsequently co-crystaUized with the target to obtain information on how the 
binding of the small molecTile can be improved (Zeslawska et al. 2000, J Mol 

30 Biol 301:465-75). Based on these findings, novel compounds are designed. 
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synthesized, tested, and co-crystallized. This optimization process is repeated 
for several rounds leading to the development of a high-affinity compoimd of 
the invention that successfully inhibits the function of its target protein. 
Finally, the toxicity of the compound is tested using standard assays 
5 (commercially available service via MDS Pharma Services, Montreal, Quebec, 
Canada) after which it is screened in an animal model system. 

Ribozymes 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules 
10 possessing endoribonuclease activity. Ribozymes are specifically designed for a 
particular target, and the target message must contain a specific nucleotide 
sequence. They are engineered to cleave any RNA species site-specifically in 
the background of cellular RNA. The cleavage event renders the mRNA 
tmstable and prevents protein expression. Importantly, ribozymes can be used 
15 to inhibit expression of a gene of unknown function for the purpose of 

determining its function in an in vitro or in vivo context, by detecting the 
phenotypic effect. 

One commonly used ribozyme motif is the hammerhead, for which the 
substrate sequence requirements are minimal. Design of the hammerhead 

20 ribozyme is disclosed in Usman et aL, Cxirrent Opin. Struct. Biol. (1996) 6:527- 
533. Usman also discusses the therapeutic uses of ribozymes. Ribozymes can 
also be prepared and used as described in Long et al., FASEB J. (1993) 7:25; 
Symons, Ann. Rev. Biochem, (1992) 61:641; Perrotta et al., Biochem. (1992) 
31:16-17; Ojwang et al., Proc. Natl. Acad. ScL (USA) (1992) 89:10802-10806; 

25 and U.S. Pat. No. 5,254,678. Ribozyme cleavage of HIV-I RNA is described in 
U.S. Pat. No. 5,144,019; methods of cleaving RNA using ribozymes is described 
in U.S. Pat. No. 5,116,742; and methods for increasing the specificity of 
ribozymes are described in U.S. Pat. No. 5,225,337 and Koiz\mii et al., Nucleic 
Acid Res. (1989) 17:7059-7071. Preparation and use of ribozyme firagments in a 

30 hammerhead structvire are also described by Koizimii et al.. Nucleic Acids Res. 



wo 2004/016317 



►CT/NL2003/000583 



34 

(1989) 17:7059-7071. Preparation and use of ribozyme fragments in a hairpin 
structure are described by Chowrira and Burke, Nucleic Acids Res. (1992) 
20:2835. Ribozsnnes can also be made by rolling transcription as described in 
Daubendiek and Kool, Nat. Biotechnol. (1997) 15(3):273-277. 
5 The hybridizing region of the ribozyme may be modified or may be 

prepared as a branched structure as described in Horn and Urdea, Nucleic 
Acids Res. (1989) 17:6959-67. The basic structure of the ribozymes may also be 
chemically altered in ways familiar to those skilled in the art, and chemically 
synthesized ribozymes can be administered as synthetic oligonucleotide 

10 derivatives modified by monomeric units. In a therapeutic context, hposome 
mediated dehvery of ribozymes improves cellular uptake, as described in 
Birikh et al., Eur. J. Biochem. (1997) 245:1-16. 

Therapeutic and functional genomic applications of ribozymes proceed 
begizmlng with knowledge of a portion of the coding sequence of the gene to be 

15 inhibited. Thus, for many genes, a nucleic acid sequence provides adequate 
sequence for constructing an effective ribozyme, A target cleavage site is 
selected in the target sequence, and a ribozyme is constructed based on the 5' 
and 3* nucleotide sequences that flank the cleavage site. Retroviral vectors are 
engineered to express monomeric and mtiltimeric hammerhead ribozjones 

20 targeting the mRNA of the target coding sequence. These monomeric and 
multimeric ribozymes are tested in vitro for an ability to cleave the target 
mRNA. A cell line is stably transduced with the retroviral vectors expressing 
the ribozymes, and the transduction is confirmed by Northern blot analysis 
and reverse-transcription poljonerase chain reaction (RT-PCR). The cells are 

25 screened for inactivation of the target mRNA by such indicators as reduction of 
expression of disease markers or reduction of the gene product of the target 
mRNA. 

Antisense 
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Antisense polynucleotides are designed to specifically bind to RNA, 
resulting in the formation of RNA-DNA or RNA-RNA hybrids, with an arrest 
of DNA replication, reverse transcription or messenger RNA translation. 
Antisense polynucleotides based on a selected sequence can interfere with 
5 expression of the corresponding gene. 

Antisense polynucleotides are typically generated within the cell by 
expression ficom antisense constructs that contain the antisense strand as the 
transcribed strand. Antisense polynucleotides will bind and/or interfere with 
the translation of the corresponding mRNA. As such, antisense may be used 
10 therapeutically the inhibit the expression of oncogenes. 

Antisense RNA or antisense oHgodeoxynucleotides (antisense ODNs) 
can both be used and may also be prepared in vitro synthetically or by means 
of recombinant DNA techniques. Both methods are well within the reach of the 
person skUled in the art. ODNs are smaller than complete antisense RNAs and 
15 have therefore the advantage that they can more easily enter the target cell. In 
order to avoid their digestion by DNAse, ODNs and antisense RNAs may be 
chemically modified. For targeting to the desired target cells, the molecules 
may be hnked to hgands of receptors found on the target cells or to antibodies 
directed against molec\iles on the surface of the target cells. 
20 Antisense therapy for a variety of cancers is in clinical phase and has 

been discussed extensively in the hterature. Reed reviewed antisense therapy 
directed at the Bcl-2 gene in tumors; gene transfer-mediated overexpression of 
Bcl-2 in tumor ceU lines conferred resistance to many types of cancer drugs. 
(Reed, J. C, N.C.L. (1997) 89:988-990). The potential for clinical development 
25 of antisense inhibitors of ras is discussed by Cowsert, L. M., Anti-Cancer Drug 
Design (1997) 12:359-371. Additional important antisense targets include 
leukemia (Geurtz, A. M., Anti-Cancer Drug Design (1997) 12:341-358); human 
C-ref kinase (Mania, B. P., Anti-Cancer Drug Design (1997) 12:327-339); and 
protein kinase C (McGraw et aL, Anti-Cancer Drug Design (1997) 12:315-326. 
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Given the extensive backgroxmd literature and clinical experience in 
antisense therapy, one skilled in the art can use selected nucleic acids of the 
invention as additional potential therapeutics. 

RNAi 

RNAi refers to the introduction of homologous double stranded RNA to 
specifically target the transcription product of a gene, resulting in a null or 
hypomorphic phenotype. RNA interference reqtures an initiation step and an 
effector step. In the first step, input double -stranded (ds) RNA is processed 
into nucleotide 'guide sequences'. These may be single- or double-stranded. The 
guide RNAs are incorporated into a nuclease complex, called the RNA-induced 
silencing complex (RISC), which acts in the second effector step to destroy 
mRNAs that are recognized by the guide RNAs through base-pairing 
interactions. RNAi molecules are thus double stranded RNAs (dsRNAs) that 
are very potent in silencing the expression of the target gene. The invention 
provides dsRNAs complementary to the genes listed in Table 1. 

The ability of dsRNA to suppress the expression of a gene corresponding 
to its own sequence is also called post-transcriptional gene silencing or PTGS. 
The only RNA molecules normally found in the cytoplasm of a cell are 
molecules of single -stranded mRNA. If the cell finds molecules of double- 
stranded RNA, dsRNA, it uses an enzyme to cut them into fii-agments 
containing in general 21-base pairs (about 2 turns of a double helix). The two 
strands of each fragment then separate enough to expose the antisense strand 
so that it can bind to the complementary sense sequence on a molecule of 
mRNA. This triggers cutting the mRNA in that region thus destroying its 
ability to be translated into a polypeptide. Introducing dsRNA corresponding 
to a particular gene will knock out the cell's endogenous expression of that 
gene. This can be done in particular tissues at a chosen time. A possible 
disadvantage of simply introducing dsRNA fragments into a cell is that gene 
expression is only temporarily reduced. However, a more permanent solution 
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is provided by introducing into the cells a DNA vector that can continuously 
synthesize a dsRNA corresponding to the gene to be suppressed. 

RNAi molecules are prepared by methods well known to the person 
skilled in the art. In general an isolated nucleic acid sequence comprising a 
nucleotide sequence which is substantially homologous to the sequence of at 
least one of the murine genomic regions Hsted in Table 1 and which is capable 
of forming one or more transcripts able to form a partially of fully double 
stranded (ds) RNA with (part of) the transcription product of said murine 
cancer genes will function as an RNAi molecule. The double stranded region 
may be in the order of between 10-250, preferably 10-100, more preferably 20- 
50 nucleotides in length. 

RNAi molecules are preferably expressed from recombinant vectors in 
transduced host cells, hematopoietic stem cells being very suitable thereto. 

Dominant Negative Mutations 

Dominant negative mutations are readily generated for corresponding 
proteins that are active as multimers. A mutant polypeptide will interact with 
wild-t3npe polsrpeptides (made from the other aUele) and form a non-functional 
multimer. Thus, a mutation is in a substrate-binding domain, a catalytic 
domain, or a cellular localization domain. Preferably, the mutant polypeptide 
will be overproduced. Point mutations are made that have such an effect. In 
addition, fusion of different polypeptides of various lengths to the terminus of a 
protein can yield dominant negative mutants. General strategies are available 
for making dominant negative mutants. See Herskowitz, Nature (1987) 
329:219-222. Such a technique can be used for creating a loss of function 
mutation, which is useful for determining the function of a protein. 

Detection probes and amplification primers 

Polynucleotide probes and primers comprising at least 8, preferably at 
least 10, more preferably at least 12 contiguous nucleotides selected from the 
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nucleotide sequence of a murine genomic region listed in Table 1 or selected 
£rom the nucleotide sequence of genes affected by transformations in said 
murine genomic regions or complements thereof are used for a variety of 
purposes, including identification of hximan chromosomes and determining 
5 transcription levels of these genes. 

The nucleotide probes are labeled, for example, with a radioactive, 
fluorescent, biotinylated, or chemiluminescent label, and detected by well 
known methods appropriate for the particular label selected. Protocols for 
hybridizing nucleotide probes to preparations of metaphase chromosomes are 

10 also well known in the art. A nucleotide probe will hybridize specifically to 
nucleotide sequences in the chromosome preparations which are 
complementary to the nucleotide sequence of the probe. A probe that 
hybridizes specifically to a polynucleotide should provide a detection signal at 
least 5-, 10-, or 20-fold higher than the backgrovmd hybridization provided 

15 with other unrelated sequences. 

Nucleotide probes are used to detect expression of a gene corresponding 
to the polynucleotide. For example, in Northern blots, mRNA is separated 
electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRNA species of a particular size. The amount of 

20 hybridization is quantitated to determine relative amounts of expression, for 
example imder a particular condition. 

Probes are also used to detect products of amplification by polymerase 
chain reaction. The products of the reaction are hybridized to the probe and 
hybrids are detected. Probes are used for in situ hybridization to cells to detect 

25 expression or to perform chromosome painting. Probes can also be used in vivo 
for diagnostic detection of hybridizing sequences. Probes are typically labeled 
with a fluorescent or luminescent label. Other types of detectable labels may 
be used such as chromophores, radioactive isotopes, and enzymes. 

Expression of specific mRNA can vary in different cell types and can be 

30 tissue specific. This variation of mRNA levels in different ceU types can be 
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exploited with nucleic acid probe assays to determine disease stages. For 
example, PGR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes substantially identical or complementary to nucleic acid 
sequences of murine genomic regions listed in Table 1 or of genes affected by 
transformations therein can determine the presence or absence of cDNA or 
mRNA related to the polynucleotides of the invention. 

Examples of a nucleotide hybridization assay are described in Urdea et 
al., PCT WO92/02526 and Urdea et al., U.S. Pat. No. 5,124,246, both 
incorporated herein by reference. The references describe an example of a 
sandwich nucleotide hybridization assay. 

Alternatively, the Polymerase Chain Reaction (PGR) is another means 
for detecting small amounts of target nucleic acids, as described in MuUis et 
al., Meth. Enzymol. (1987) 155:335-350; U.S. Pat. No. 4,683,195; and U.S. Pat. 
No. 4,683,202, all incorporated herein by reference. Two primer 
polynucleotides nucleotides hybridize with the target nucleic acids and are 
used to prime the reaction. The primers may be composed of sequence within 
or 3' and 5' to the nucleic acid sequences of mxirine genomic regions listed in 
Table 1 or of genes affected by transformations therein. Alternatively, if the 
primers are 3' and 5* to these nucleic acid sequences, they need not hybridize 
to them or the complements. Yet, they may for instance also hybridize to 
sequences flanking the nucleic acid sequences of murine genomic regions hsted 
in Table 1 or of genes affected by transformations therein. A thermostable 
polymerase creates copies of target nucleic acids from the primers using the 
original target nucleic acids as a template. After a large amount of target 
nucleic acids is generated by the polymerase, it is detected by methods such as 
Southern blots. When using the Southern blot method, the labeled probe will 
hybridize to a nucleic acid sequence of a murine genomic region Hsted in Table 
1 or of a gene affected by a transformation therein or complement. 

Furthermore, mRNA or cDNA can be detected by traditional blotting 
techniques described in Sambrook et al., "Molecular Cloning: A Laboratory 
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Manual" (New York, Cold Spring Harbor Laboratory, 1989). mRNA or cDNA 
generated &om raRNA using a pol3nnerase enzyme can be ptirified and 
separated using gel electrophoresis. The nucleic acids on the gel are then 
blotted onto a solid support, such as nitrocellulose. The solid support is 

5 exposed to a labeled probe and then washed to remove any unhybridized probe. 
Next, the duplexes containing the labeled probe are detected. Typically, the 
probe is labeled with radioactivity. 

Additionally, mRNA or cDNA can be detected by using probe arrays. 
mRNA is obtained for instance from tumor tissue and cDNA is generated 

0 therefrom by reverse PGR. The cDNA is optionally labeled by incorporation of 
fluorescently labeled nucleotide analogues therein and is hybridized to the 
probe array. Expression levels are determined from the intensity of the 
fluorescent signal obtained from the corresponding or respective probe sites on 
the array. 



Use of Polynucleotides to Raise Antibodies 

The present invention provides in one aspect for antibodies suitable for 
therapeutic and/or diagnostic use. 

Therapeutic antibodies include antibodies that can bind specifically to 
the expression products of the genes (partly) encoded in the nucleic acid 
sequences of murine genomic regions listed in Table 1 or of genes affected by 
transformations in these regions. By binding directly to the gene products, the 
antibodies may influence the function of their targets by, for example, in the 
case of proteins, steric hindrance, or by blocking at least one of the functional 
domains of those proteins. As such, these antibodies may be used as inhibitors 
of the function of the gene product. Such antibodies may for instance be 
generated against functionally relevant domains of the proteins and 
subsequently screened for their ability to interfere with the target's function 
using standard techniques and assays (see for instance Schwartzberg, 2001, 
Grit Rev Oncol Hematol 40:17-24; Herbst et at. Cancer 94:1593-611, 2002). 
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Alternatively, anti-RNA antibodies may for instance be useful in 
silencing messengers of the timior-related genes of the present invention. In 
another alternative, antibodies may also be used to influence the function of 
their targets indirectly, for instance by binding to members of signaling 
5 pathways in order to influence the function of the targeted proteins or nucleic 
acids. In yet another alternative, therapeutic antibodies may carry one or more 
toxic compounds that exert their effect on the target or target cell by virtue of 
the binding of the carrying antibody thereto. 

For diagnostic purposes, antibodies similar to those above, preferably 

10 those that are capable of binding to the expression products of the genes of the 
present invention may be used, and that are provided with detectable labels 
such as fluorescent, luminescent, or radio-isotope labels in order to allow the 
detection of the gene product. Preferably such diagnostic antibodies are 
targeted to proteinaceous targets present on the outer envelop of the cell, such 

15 as membrane bound target proteins. The use of antibodies for the diagnosis of 
specific types of cancer is known to the skilled person (Sjrrigos et ah 1999, 
Hybridoma 18:219-24). 

The antibodies used in the present invention may be firom any animal 
origin including birds and mammals (e.^., human, murine, donkey, sheep, 

20 rabbit, goat, gxiinea pig, camel, horse, or chicken). Preferably, the antibodies of 
the invention are human or hxmianized monoclonal antibodies. As used herein, 
"himian" antibodies include antibodies having the amino acid sequence of a 
human immimoglobulin and include antibodies isolated &om human 
immunoglobulin libraries (including, but not limited to, synthetic libraries of 

25 immimoglobulin sequences homologous to human immunoglobulin sequences) 
or firom mice that express antibodies from human genes. 

For some uses, including in vivo therapeutic or diagnostic use of 
antibodies in humans and in vitro detection assays, it may be preferred to use 
human or chimeric antibodies. Completely human antibodies are particularly 

30 desirable for therapeutic treatment of human subjects. Himian antibodies can 
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be made by a variety of methods known in the art including phage display 
methods described above using antibody hbraries derived &om human 
immunoglobulin sequences or synthetic sequences homologous to human 
immunoglobulin sequences. See also U.S. Patent Nos. 4,444,887 and 4,716,111; 
5 and PCT publications WO 98/46645, WO 98/50433, WO 98/24893 and 

W098/16654, each of which is incorporated herein by reference in its entirety. 

The antibodies to be used with the methods of the invention include 
derivatives that are modified, Le, by the covalent attachment of any type of 
molecule to the antibody such that covalent attachment. Additionally, the 
10 derivative may contain one or more non-classical amino acids. 

In certain embodiments of the invention, the antibodies to be used with 
the invention have extended half-hves in a mammal, preferably a human, 
when compared to unmodified antibodies. Antibodies or antigen-binding 
fragments thereof having increased in vivo half-lives can be generated by 
15 techniques known to those of skill in the art (see, e.g., PCT PubUcation No. WO 
97/34631). 

In certain embodiments, antibodies to be used with the methods of the 
invention are single-chain antibodies. The design and construction of a single- 
chain antibody is described in Marasco et al, 1993, Proc Natl Acad Sci 90:7889- 
20 7893, which is incorporated herein by reference in its entirety. 

In certain embodiments, the antibodies to be used with the invention 
bind to an intraceUxilar epitope, i.e,, are intrabodies. An intrabody comprises at 
least a portion of an antibody that is capable of immunospecifically binding an 
antigen and preferably does not contain sequences coding for its secretion. 
25 Such antibodies will bind its antigen intracellularly. In one embodiment, the 
intrabody comprises a single-chain Fv ("sPv''). For a review of sPv see 
Pluckthun in The Pharmacology of Monoclonal Antibodies y vol. 113, Rosenburg 
and Moore eds. Springer- Verlag, New York, pp. 269-315 (1994). In a further 
embodiment, the intrabody preferably does not encode an operable secretory 
30 sequence and thus remains within the cell (see generally Marasco, WA, 1998, 
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"Intrabodies: Basic Research and Clinical Gene Therapy Applications** 
Springer:New York). 

Generation of intrabodies is well-known to the skilled artisan and is 
described for example in U.S. Patent Nos. 6,004,940; 6,072,036; 5,965,371, 
5 which are incorporated by reference in their entireties herein. 

In one embodiment, intrabodies are e^ressed in the cytoplasm. In other 
embodiments, the intrabodies are localized to various intracellular locations. 
In such embodiments, speci&c localization sequences can be attached to the 
intranucleotide polypepetide to direct the intrabody to a specific location. 
10 The antibodies to be used with the methods of the invention or 

fragments thereof can be produced by any method known in the art for the 
synthesis of antibodies, in particular, by chemical synthesis or preferably, by 
recombinant expression techniques. 

Monoclonal antibodies can be prepared using a wide variety of 
16 techniques known in the art including the use of hybridoma, recombinant, and 
phage display technologies, or a combination thereof. For example, monoclonal 
antibodies can be produced using hybridoma techniques including those known 
in the art auid taught, for example, in Harlow et aZ., Antibodies: A Laboratory 
Manual, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et 
20 aZ., in: Monoclonal Antibodies and T-Cell Hybridomas 563-681 (Elsevier, N.Y,, 
1981) (said references incorporated by reference in their entireties). The term 
"monoclonal antibody*' as used herein is not limited to antibodies produced 
through hybridoma technology. The term "monoclonal antibody^' refers to an 
antibody that is derived from a single clone, including any eukaryotic, 
25 prokaryotic, or phage clone, and not the method by which it is produced. 

Examples of phage display methods that can be used to make the 
antibodies of the present invention include those disclosed in W097/13844; and 
U.S. Patent Nos. 5,580,717, 5,821,047, 5,571,698, 5,780,225, and 5,969,108; 
each of which is incorporated herein by reference in its entirety. 
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As described in the above references, affcer phage selection, the antibody 
coding regions from the phage can be isolated and used to generate whole 
antibodies, including hvunan antibodies, or any other desired antigen binding 
fragment, and expressed in any desired host, including mammahan cells, 
5 insect cells, plant cells, yeast, and bacteria, e.g., as described below. 

Techniques to recombinantly produce Fab, Fab' and F(ab')2 fragments can also 
be employed using methods known in the art such as those disclosed in PCT 
publication No. WO 92/22324; MulHnax et al, 1992, BioTechniques 12(6):864- 
869 and Better et aL, 1988, Science 240:1041-1043 (said references 

10 incorporated by reference in their entireties). 

It is also possible to produce therapeutically useful IgG, IgA, IgM and 
IgE antibodies. For an overview of the technology for producing human 
antibodies, see Lonberg and Huszar (1995, InL Rev. Immunol. 13:65-93). For a 
detailed discussion of the technology for producing human antibodies and 

15 human monoclonal antibodies and protocols for producing such antibodies, see, 
e.g., PCT publication No. WO 98/24893, which is incorporated by reference 
herein in its entirety. In addition, companies such as Medarex, Inc. (Princeton, 
NJ), Abgenix, Inc. (Freemont, OA) and Genpharm (San Jose, CA) can be 
engaged to provide human antibodies directed against a selected antigen using 

20 technology similar to that described above. 

Recombinant expression used to produce the antibodies, derivatives or 
analogs thereof (e.g., a heavy or hght chain of an antibody of the invention or a 
portion thereof or a single chain antibody of the invention), requires 
construction of an e:^ression vector containing a polynucleotide that encodes 

25 the antibody and the expression of said vector in a suitable host cell or even in 
vivo,Once a polynucl eotide encoding an antibody molecule or a heavy or hght 
chain of an antibody, or portion thereof (preferably, but not necessarily, 
containing the heavy or Ught chain variable domain), of the invention has been 
obtained, the vector for the production of the antibody molecule may be 

30 produced by recombinant DNA technology using techniques well known in the 
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art. Thus, methods for preparing a protein by expressing a polynucleotide 
containing an antibody encoding nucleotide sequence are described herein. 
Methods which are well known to those skilled in the art can be used to 
construct expression vectors containing antibody coding sequences and 
5 appropriate transcriptional and translational control signals. These methods 
include, for example, in vitro recombinant DNA techniques, synthetic 
techniques, and in vivo genetic recombination. The invention, thus, provides 
replicable vectors comprising a nucleotide sequence encoding an antibody 
molecule of the invention, a heavy or Ught chain of an antibody, a heavy or 

10 light chain variable domain of an antibody or a portion thereof, or a heavy or 
light chain CDR, operably linked to a promoter. Such vectors may include the 
nucleotide sequence encoding the constant region of the antibody molecule 
(see, e.g., PCT Pubhcation WO 86/05807; PCT PubHcation WO 89/01036; and 
U.S. Patent No. 5,122,464) and the variable domain of the antibody may be 

.15 cloned into such a vector for expression of the entire heavy, the entire Hght 
chain, or both the entire heavy and Ught chains. 

The expression vector is transferred to a host cell by conventional 
techniques and the transfected ceUs are then cultured by conventional 
techniques to produce an antibody of the invention. Thus, the invention 

20 includes host cells containing a polynucleotide encoding an antibody of the 
invention or fragments thereof, or a heavy or light chain thereof, or portion 
thereof, or a single chain antibody of the invention, operably linked to a 
heterologous promoter. In preferred embodiments for the expression of double- 
chained antibodies, vectors encoding both the heavy and hght chains may be 

25 co-expressed in the host cell for expression of the entire immunoglobulin 
molecule, as detailed below. 

A variety of host-expression vector systems may be utihzed to' express 
the antibody molecules of the invention (see above) 

In mammalian host cells, a number of vireil-based expression systems 

30 may be utihzed. In cases where an adenovirus is used as an expression vector. 
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the antibody coding sequence of interest may be ligated to an adenovirus 
transcription/translation control complex, e,g., the late promoter and tripartite 
leader sequence. This chimeric gene may then be inserted in the adenovirus 
genome by in vitro or in vivo recombination. Insertion in a non-essential region 
5 of the vixal genome (e.g.^ region El or E3) will result in a recombinant virus 

that is viable and capable of expressing the antibody molecule in infected hosts 
ie.g., see Logan & Shenk, 1984, Proa Natl. Acad. Sci. USA 81:355-359). 
Specific initiation signals may also be required for efficient translation of 
inserted antibody coding sequences. These signals include the ATG initiation 

10 codon and adjacent sequences. Furthermore, the initiation codon must be in 
phase with the reading frame of the desired coding sequence to ensure 
translation of the entire insert. These exogenous translational control signals 
and initiation codons can be of a variety of origins, both nattiral and synthetic. 
The eflBiciency of expression may be enhanced by the inclusion of appropriate 

15 transcription enhancer elements, transcription terminators, etc. (see, e.g., 
Bittner et al., 1987, Methods ia Enzymol. 153:516-544). 

Once an antibody molecule to be used with the methods of the invention 
has been produced by recombinant expression, it may be purified by any 
method known in the art for purification of an immunoglobulin molecule, for 

20 example, by chromatography (e.g,, ion exchange, affinity, particvilarly by 
affinity for the specific antigen after Protein A, and sizing column 
chromatography), centrifugation, differential solubility, or by any other 
standard technique for the purification of proteins. Further, the antibodies of 
the present invention or firagments thereof may be fused to heterologous 

25 polypeptide sequences described herein or otherwise known in the art to 
facilitate purification. 

As stated above, according to a farther aspect, the invention provides an 
antibody as defined above for use in therapy. 

For therapeutic treatment, antibodies may be produced in vitro and 

30 applied to the subject in need thereof. The antibodies may be administered to a 
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subject by any suitable route, preferably in the form of a pharmaceutical 
composition adapted to such a route and in a dosage which is effective for the 
intended treatment. Therapeutically effective dosages of the antibodies 
required for decreasing the rate of progress of the disease or for eliminating 
5 the disease condition can easily be determined by the skilled person. 

Alternatively, antibodies may be produced by the subject itself by using 
in vivo antibody production methodologies as described above. Suitably, the 
vector used for such in vivo production is a viral vector, preferably a viral 
vector with a target cell selectivity for specific cancer cells. 

10 Therefore, according to a still further aspect, the invention provides the 

use of an antibody as defined above in the manufacture of a medicament for 
use in the treatment of a mammal to achieve the said therapeutic effect. The 
treatment comprises the adininistration of the medicament in a dose sufficient 
to achieve the desired therapeutic effect. The treatment may comprise the 

15 repeated administration of the antibody. 

According to a still further aspect, the invention provides a method of 
treatment of a human comprising the administration of an antibody as defined 
above in a dose sufficient to achieve the desired therapeutic effect. The 
therapeutic effect being the alleviation or prevention of various types of 

20 cancers such as bone cancers, brain timiors, breast cancer, endocrine system 
cancers, gastrointestinal cancers, gynaecologic cancers, head and neck cancers, 
lexikemia, Ixmg cancers, lymphomas, metastases, myelomas, pediatric cancers, 
penile cancer, prostate cancer, sarcomas, skin cancers, testicular cancer, 
thyroid cancer, urinary tract cancers, more in particular the remission or 

25 prevention of relapse of soUd tumors of bladder, cervix, lung, colon, breast, 
prostate, ovary, pancreas, hver, testicle, uterus, bone, oral cavity and 
oropharynx tissue, preferable of leukemia, such as acute myeloid leukemia 
(AML), acute lymphocytic lexikemia (ALL), chronic myelogenous leukemia 
(CML) and chronic lymphocytic lexikemia (CLL), most preferably of AML. 
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The diagnostic and therapeutic antibodies are preferably used in their 
respective application for the targeting of kinases or phosphatases, which are 
often coupled to receptor molecules on the cell's surface. As such, antibodies 
capable of binding to these receptor molecules can exert their activity- 
5 modixlating effect on the kinases or phosphatases by binding to the respective 
receptors. Also transporter proteins may be targeted with advantage for the 
same reason that the antibodies will be able to exert their activity-modulating 
effect when present extraceUularly. The above targets, together with signaling 
molecules, represent preferred targets for the antibody uses of the invention as 

10 more effective therapy and easier diagnosis is possibly thereby. 

The diagnostic antibodies can suitably be used for the qualitative and 
quantitative detection of gene products, preferably proteins in assays for the 
determination of altered levels of proteins or structural changes therein. 
Protein levels xnay for instance be determined in cells, in ceU extracts, in 

15 supematants, body fluids by for instance flow-c3rtometric evaluation of 

immunostained cancer cells. Alternatively, quantitative protein assays such as 
ELISA or RIA, Western blotting, and imaging technology (e.g., using confocal 
laser scanning microscopy) may be used in concert with the antibodies as 
described herein for the diagnosis of cancers, preferably letikemia. 

20 

Use of Polynucleotides to Construct Arrays for Diagnostics 

A polynucleotide capable of hybridising to a murine genomic region of 
the present invention as weU as sequences and gene products thereof is useful 
for determirdng the occurrence of cancer, tumor progression, hyperproliferative 

25 growth, and/or accompanying biological or physical manifestations. 

Specifically, the polynucleotides representing the murine genomic regions and 
genes affected by transformations therein and encoded polypeptides can be 
utihzed to determine the occurrence of le\akemia, such as AML. 
To determine the occurrence of cancer, tximor progression, 

30 hyperproliferative growth, and/or accompanying biological or physical 
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manifestations, the levels of polsmucleotides and/or encoded polypeptides of the 
present invention in a sample are compared to the levels in a normal control of 
body tissues, cells, organs, or fluids. The normal control can include a pool of 
cells from a particular organ or tissue or tissues and/or cells from throughout 
5 the body. For such measurements either immunoassays or nucleic acid assays 
as known in the art can be used. 

Any observed diflference between the sample and normal control can 
indicate the occurrence of disease or disorder. Typically, if the levels of the 
polynucleotides and the encoded polj^eptides of the present invention are 
10 higher than those found in the normal control, the results indicate the 

occurrence of cancer, tumor progression, hyperproliferative growth, and/or 
accompanying biological or physical manifestations. 

In addition, the present murine genomic regions can be useful to 
diagnose the severity as well as the occurrence of cancer, tumor progression, 
15 hyperproliferative growth, and/or accompanying biological or physical 

manifestations. For example, the greater the difference observed in the sample 
versus the normal control of the expression products of the genes comprised in 
said genomic regions or affected by transformations therein, the greater the 
severity of the disorder, in particular, when higher levels as compared to a 
20 normal control are observed. 

Polynucleotide or ohgonucleotide arrays provide a high throughput 
technique that can assay a large number of polynucleotide sequences in a 
sample. This technology can be used as a diagnostic and as a tool to test for 
differential expression to determine function of an encoded protein. 
25 Polynucleotide or oligonucleotide arrays constitute a specific embodiment of a 
diagnostic composition according to the present invention. 

To create arrays, polynucleotide or oligonucleotide probes are spotted 
onto a substrate in a two-dimensional matrix or array. Samples of 
polynucleotides can be labeled and then hybridized to the probes. Double 
30 stranded polynucleotides, comprising the labeled sample polynucleotides 
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bound to probe polynucleotides or oligonucleotide, can be detected once the 
unbound portion of the sample is washed away. 

The probe polynucleotides or oHgonucleotide can be spotted on 
substrates including glass, nitrocellulose, etc. The probes can be bound to the 
5 substrate by either covalent bonds or by non-specific interactions, such as 
hydrophobic interactions. The sample polynucleotides can be labeled using 
radioactive labels, fluorophors, etc. 

Techniques for constructing arrays and methods of using these arrays 
are described in EP No. 0 799 897; PCT No. WO 97/29212; POT No. WO 
10 97/27317; EP No. 0 785 280; PCT No. WO 97/02357; U.S. Pat. No. 5,593,839; 
U.S. Pat. No. 5,578,832; EP No. 0 728 520; U.S. Pat. No. 5,599,695; EP No. 0 
721 016; U.S. Pat. No. 5,556,752; PCT No. WO 95/22058; and U.S. Pat. No. 
5,631,734. 

Further, arrays can be used to examine differential expression of genes 
15 and can be used to determine gene function. For example, arrays of the instant 
polynucleotide sequences of the murine genomic regions and of genes affected 
by transformations therein can be used to determine if any of the sequences 
are differentially expressed between normal cells and cancer cells, for example. 
High expression of a particular message in a cancer cell, which is not observed 
20 in a corresponding normal cell, can indicate a cancer specific protein. 

Differential Expression 

The present invention also provides a method to identify abnormal or 
diseased tissue in a human. The expression of a gene corresponding to a 

25 specific polynucleotide is compared between a first tissue that is suspected of 
being diseased and a second, normal tissue of the human. The normal tissue is 
any tissue of the human, especially those that express the murine genomic 
region-related gene including, but not limited to, blood cells, brain, thymus, 
testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, 

30 pancreas, and the mucosal lining of the colon. 
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The expression products of mtirine genomic region-related genes in the 
two tissues are compared by any means known in the art* 

Murine genomic region -related mRNA in the two tissues is compared. 
PolyA RNA is isolated from the two tissues as is known in the art. For 
example, one of skill in the art can readily determine differences in the size or 
amoxmt of polynucleotide-related mRNA transcripts between the two tissues 
using Northern blots and nucleotide probes. Increased or decreased expression 
of an poljmucleotide-related mRNA in a tissue sample suspected of being 
diseased, compared with the expression of the same polynucleotide-related 
mRNA in a normal tissue, indicates manifestation of the disease. 

Screening for Peptide Analogs and Antagonists 

Polypeptide expression products of mxirine genomic region-related genes 
as well as corresponding full length genes can be used to screen peptide 
Hbraries to identify binding partners, such as receptors, from among the 
encoded polypeptides. 

Such binding partners can be useful in treating cancer, tumor 
progression, hsTperproUferative ceU growth, and/or accompanying biological or 
physical manifestations. For example, peptides or other compounds that are 
capable of binding or interacting with membrane -bound polypeptides encoded 
by one or more of the murine genomic regions listed in Table 1, can be useful 
as a therapeutic. Also, peptides or other compounds capable of altering the 
conformation of any of the encoded polypeptides by one or more of the murine 
genomic regions listed in Table 1 can inhibit biological activity and be useful 
as a therapeutic. 

A library of peptides may be S3nithesized following the methods 
disclosed in U.S. Pat. No. 5,010,175, and in PCT W091/17823. 

Peptide agonists or antagonists are screened using any available 
method, such as signal transduction, antibody binding, receptor binding, 
mitogenic assays, chemotaxis assays, etc. The methods described herein are 
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presently preferred. The assay conditions ideally shoxild resemble the 
conditions under which the native activity is exhibited in vivo, that is, tinder 
physiologic pH, temperature, and ionic strength. Stiitable agonists or 
antagonists will exhibit strong inhibition or enhancement of the native activity 
5 at concentrations that do not cause toxic side effects in the subject. Agonists or 
antagonists that compete for binding to the native polypeptide may require 
concentrations equal to or greater than the native concentration, while 
inhibitors capable of binding irreversibly to the polypeptide may be added in 
concentrations on the order of the native concentration. 
10 The end results of such screening and experimentation will be at least 

one novel polypeptide binding partner, such as a receptor, encoded by a murine 
genomic region-related gene of the invention, and at least one peptide agonist 
or antagonist of the novel binding partner. Such agonists and antagonists can 
be used to modulate receptor function in cells to which the receptor is native, 
15 or in cells that possess the receptor as a result of genetic engineering. Further, 
if the novel receptor shares biologically important characteristics with a known 
receptor, information about agojodst/antagonist binding may help in developing 
improved agonists/antagonists of the known receptor. 

Therapeutics, whether polynucleotide or polypeptide or small molectde, 
20 can be tested, for example, in the mouse tumor assay described in Pei et aL, 
MoL Endo. 11: 433-441 (1997). 

Other models for testing polynucleotides, polypeptides, antibodies, or 
small molecules useful for treatment include: animal models and cell lines 
disclosed in Bosland, Encyclopedia of Cancer, Volume II, pages 1283 to 1296 
25 (1997) by Academic Press. Other useful cell lines are described in Brothman, 
Encyclopedia of Cancer, Volume II, pages 1303 to 1313 (1997) by Academic 
Press 

Pharmaceutical Compositions and Therapeutic Uses 
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Pharmaceutical compositions can comprise polypeptides, antibodies, 
polynucleotides (antisense, RNAi, ribozyme), or small molecules of the claimed 
invention, collectively called inhibitor compounds herein. The pharmaceutical 
compositions will comprise a therapeutically effective amount of either 
5 polypeptides, antibodies, polynucleotides or small molecules of the claimed 
invention. 

The term "therapeutically effective amount" as used herein refers to an 
amount of a therapeutic agent to treat, ameUorate, or prevent a desired 
disease or condition, or to exhibit a detectable therapeutic or preventative 

10 effect. The effect can be detected by, for example, chemical markers or antigen 
levels. Therapeutic effects also include reduction in physical symptoms, such 
as decreased body temperature. The precise effective amount for a subject will 
depend upon the subject's size and health, the nature and extent of the 
condition, and the therapeutics or combination of therapeutics selected for 

15 administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be 
determined by routine experimentation and is within the judgment of the 
clinician. Specifically, the compositions of the present invention can be used to 
treat, amehorate, or prevent cancer, tumor progression, hyperproliferative cell 

20 growth and/or accompanying biological or physical manifestations, including 
leukaemia. 

For purposes of the present invention, an effective dose will be from 
about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the 
polynucleotide, polypeptide or antibody compositions in the individual to which 
25 it is administered. 

A pharmaceutical composition can also contain a pharmaceutically 
acceptable carrier. The term "pharmaceutically acceptable carrier" refers to a 
carrier for administration of a therapeutic agent, such as antibodies or a 
polypeptide, genes, and other therapeutic agents. The term refers to any 
30 pharmaceutical carrier that does not itself induce the production of antibodies 
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harmful to the individual receiving the composition, and which may be 
administered without imdue toxicity. Suitable carriers may be large, slowly 
metaboHzed macromolecules such as proteins, polysaccharides, polylactic 
acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, and 
5 inactive virus particles. Such carriers are well known to those of ordinary skill 
in the art. 

Pharmaceutically acceptable salts can be used therein, for example, 
mineral acid salts such as hydrochlorides, hydrobromides, phosphates, 
sulfates, and the like; and the salts of organic acids such as acetates, 

10 propionates, malonates, benzoates, and the like. A thorough discussion of 
pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions may 
contain liquids such as water, saline, glycerol and ethanoL Additionally, 

15 auxiUary substances, such as wetting or emulsifying agents, pH bufiGBring 
substances, and the like, may be present in such vehicles. Typically, the 
therapeutic compositions are prepared as injectables, either as hquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid 
vehicles prior to injection may also be prepared. Liposomes are included within 

20 the definition of a pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the pharmaceutical compositions of the invention can 
be (1) administered directly to the subject; (2) delivered ex vivo, to cells derived 
25 from the subject; or (3) dehvered in vitro for e3q)ression of recombinant 
proteins. 

Direct delivery of the compositions will generally be accomplished by 
injection, either subcutaneously, intraperitoneally, intravenously or 
intramusciilarly, or delivered to the interstitial space of a tissue. The 
30 compositions can also be administered into a tumor or lesion. Other modes of 
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administration include topical, oral, catheterized and pulmonary 
administration, suppositories, and transdermal applications, needles, and 
particle gunsor hyposprays. Dosage treatment may be a single dose schedxxle 
or a multiple dose schedule. 
5 Methods for the ex vivo delivery and reimplantation of transformed cells 

into a subject are known in the art and described in e.g., International 
Publication No. WO 93/14778. Examples of cells useful in ex vivo applications 
include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

10 Generally, dehvery of nucleic acids for both ex vivo and in vitro 

appHcations can be accompHshed by, for example, dextran*-mediated 
transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into 

15 nuclei, all weU known in the art. 

Various methods are used to administer the therapeutic composition 
directly to a specific site in the body. For example, a small metastatic lesion is 
located and the therapeutic composition injected several times in several 
different locations within the body of tumor. Alternatively, arteries which 

20 serve a tumor are identified, and the therapeutic composition injected into 
such an artery, in order to dehver the composition directly into the tumor. A 
tumor that has a necrotic center is aspirated and the composition injected 
directly into the now empty center of the tumor. The antisense composition is 
directly administered to the surface of the tumor, for example, by topical 

25 apphcation of the composition. X-ray imaging is used to assist in certain of the 
above dehvery methods. 

Receptor-mediated targeted dehvery of therapeutic compositions 
containing an antisense polynucleotide, sub genomic polynucleotides, or 
antibodies to specific tissues is also used. Receptor-mediated DNA dehvery 

30 techniques are described in, for example, Findeis et al.. Trends in Biotechnol. 
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(1993) 11:202-205; Chiou et al., (1994) Gene Therapeutics: Methods And 
Applications Of Direct Gene Transfer (J. A. Wolff, ed.); Wu et al., J. Biol. 
Chem. (1994) 269:542-46. Preferably, receptor-mediated targeted delivery of 
therapeutic compositions containing antibodies of the invention is used to 
5 deliver the antibodies to specific tissue. 

Pharmaceutical compositions containing antisense, ribozsrme or RNAi 
polynucleotides are administered in a range of about 100 ng to about 200 mg of 
polynucleotides for local administration in a gene therapy protocol. 
Concentration ranges of about 500 ng to about 50 mg, about 1 p.g to about 2 

10 mg, about 5 \ig to about 500 ^ig, and about 20 fig to about 100 fig of 

polynucleotides can also be used during a gene therapy protocol. Factors such 
as method of action and efficacy of transformation and expression are 
considerations which will affect the dosage required for ultimate efficacy of the 
poljniucleotides- Where greater expression is desired over a larger area of 

15 tissue, larger amoimts of polynucleotides or the seime amoxmts readministered 
in a successive protocol of adniinistrations, or several administrations to 
difiEerent adjacent or close tissue portions of, for example, a tumor site, may be 
required to effect a positive therapeutic outcome. In all cases, routine 
experimentation in clinical trials will determine specific ranges for optimal 

20 therapeutic effect. A more complete description of gene therapy vectors, 

especially retroviral vectors, is contained in U.S. Ser. No. 08/869,309, which is 
expressly incorporated herein. 

Development of specific therapeutic methods 

25 As iQustrated by the case wherein a specific translocation, e.g. the 

t(15,17) translocation as described above, can be assigned to a specific 
mutation and to a dysfunctional gene, the identity and more specifically the 
functionahty of that gene may aid significantly in the development of methods 
of treatment. The functionahty of a gene may be determined by methods 

30 known in the art, for instance by structm'e-function analysis. 
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Genes that are shown to be both differentially expressed and 
functionally important for human tumors are selected for structure-function 
analysis. Deletion and point mutation mutants of these genes are constructed 
and tested for their functional competence compared with wild-type genes 
5 (according to Ibanez et ah Structural and functional domains of the myb 
oncogene: requirements for nuclear transport, myeloid transformation, and 
colony formation. J Virol 62:1981-8, 1988; Rebay et al. Specific truncations of 
Drosophila Notch define dominant activated and dominant negative forms of 
the receptor. Cell 74:319-29, 1993). These studies lead to the identification of 

10 functionally critical domains as well as to dominant-negative variants, i.e. 

mutant genes that suppress the function of their endogenous counterparts (e.g. 
Kashles et al. A dominant negative mutation suppresses the function of normal 
epidermal growth factor receptors by heterodimerization. Mol Cell Biol 
11:1454-63, 1991). These dominant-negatives provide additional proof for the 

15 functional importance of the genes and give insight into which therapeutic 

approaches can be pursued to interfere with their function or that can support 
their impaired function. 

Development of an inhibitor compound 

20 The present invention also provides methods for the development and 
production of inhibitor compounds capable of inhibiting the genes or 
expression products disclosed in the present invention that play a role in the 
development of cancer. Essentially, such methods are well known in the art 
and generally comprise a number of consecutive steps starting with the 

25 identification of genes involved in cancer, in particular as disclosed in 

Examples 2 and 3 below, by using retroviral insertional tagging, optionally in 
a specific genetic background; 

b) validation of one or more of the identified genes as potential target gene(s) 
for the inhibitor compound by one or more of the following methods: 
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- coiifirinatioii of the identified gene by Northern Blot analysis in cancer cell- 
lines; 

- determination of the expression profile of the identified gene in tumors and 
normal tissue; 

5 - determination of the functional importance of the identified genes for cancer; 
c) production of the expression product of the gene; and 

d) use of the expression product of the gene for the production or design 
of an inhibitor compound. 

10 Functional importance of the identified genes for human cancer 

The functional importance of the differentially expressed genes for 
tumor cells is determined by over-expression as well as by inhibition of the 
expression of these genes. Selected human tumor ceU lines are transfected 
either with plasmids encoding cDNA of the genes or with plasmids encoding 

15 RNA interference probes for the genes. RNA interference is a recently 
developed technique that involves introduction of double-stranded 
oHgonucleotides designed to block expression of a specific gene (see Elbashir et 
al Nature 411:494-8, 2001; Brummelkamp et ah Science 296:550-3, 2002). 
Using standard laboratory techniques and assays, the transfected cell lines are 

20 extensively checked for altered phenotjnpes that are relevant for the tumor 

cells, e.g. cell cycle status, proliferation, adhesion, apoptosis, invasive abihties, 
etc. These experiments demonstrate the functional importance of the identified 
genes for human cancer. 

25 Gene Therapy 

The therapeutic polynucleotides and polypeptides of the present 
invention may be utilized in gene delivery vehicles. The gene delivery vehicle 
may be of viral or non- viral origin (see generally, Jolly, Cancer Gene Therapy 
(1994) 1:51-64; Kimura, Human Gene Therapy (1994) 5:845-852; ConneUy, 

30 Human Gene Therapy (1995) 1:185-193; and Kaphtt, Nature Genetics (1994) 
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6:148-153), Gene therapy vehicles for dehvery of constructs including a coding 
sequence of a therapeutic of the invention can be administered either locally or 
systemicaUy. These constructs can utilize viral or non-viral vector approaches. 
Expression of such coding sequences can be induced using endogenous 
5 mammahan or heterologous promoters. Expression of the coding sequence can 
be either constitutive or regidated. 

The present invention can employ recombinant retroviruses which are 
constructed to carry or e:»press a selected nucleic acid molecule of interest. 
Retrovirus vectors that can be employed include those described in EP 0 415 
10 731; WO 94/03622; WO 93/25698; WO 93/25234; U.S. Pat. No. 5, 219,740; and 
EP 0 345 242. Preferred recombinant retroviruses include those described in 
WO 91/02805. 

Packaging cell Hues suitable for use with the above-described retroviral 
vector constructs may be readily prepared (see PCT pubHcations WO 95/30763 

15 and WO 92/05266), and used to create producer ceU Hues (also termed vector 
cell lines) for the production of recombinant vector particles. Within 
particularly preferred embodiments of the invention, packaging cell Hnes are 
made from human (such as HT1080 cells) or mink parent cell Hnes, thereby 
allowing production of recombinant retroviruses that can survive inactivation 

20 in human serum. 

The present invention also employs alphavir\xs-hased vectors that can 
function as gene delivery vehicles. Such vectors can be constructed from a wide 
variety of alphaviruses, including, for example, Sindbis virus vectors, SemKki 
forest virus (ATCC VR-67; ATCC VR-1247), Ross River virus (ATCC VR-373; 

25 ATCC VR-1246) and Venezuelan equine encephalitis virus (ATCC VR-923: 
ATCC VIl-1250; ATCC VR 1249; ATCC VR-532). Representative examples of 
such vector systems include those described in U.S. Pat. Nos. 5,091,309; 
5,217,879; and 5,185,440; and PCT PubUcation Nos. WO 92/10578; WO 
94/21792; WO 95/27069; WO 95/27044; and WO 95/07994. 
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Gene delivery vehicles of the present invention can also employ 
parvovirus such as adeno-associated virus (AAV) vectors. Representative 
examples include the AAV vectors disclosed by Srivastava in WO 93/09239, 
Samulski et al., J. Vir. (1989) 63:3822-8828; Mendelsonet al., Virol. (1988) 
5 166:154-165; and Flotte et aL. PNAS (1993) 90:10613-10617. * 

Representative examples of adenoviral vectors include those described 
by Berkner, Biotechniques (1988) 6:616-627; Rosenfeld et al.. Science (1991) 
252:431-434; WO 93/19191; KoUs et al., PNAS (1994) 91:215-219; Kass-Eisler 
et al., PNAS (1993) 90:11498-11502; Guzman et al., Circulation (1993) 

10 88:2838-2848; Guzman et al., Cir. Res. (1993) 73:1202-1207; Zabner et al., Cell 
(1993) 75:207-216; Li et al., Hum. Gene Ther. (1993) 4:403-409; CaHaud et al., 
Eur. J. Neurosci. (1993) 5:1287-1291; Vincent et al., Nat. Genet. (1993) 5:130- 
134; Jaffe et al., Nat. Genet. (1992) 1:372-378; and Levrero et al.. Gene (1991) 
101:195-202. Exemplary adenoviral gene therapy vectors employable in this 

15 invention also include those described in WO 94/12649, WO 93/03769; WO 
93/19191; WO 94/28938; WO 95/11984 and WO 95/00655. Administration of 
DNA link ed to killed adenovirus as described in Ciiriel, Hum. Gene Ther. 
(1992) 3:147-154 may be employed. 

Other gene delivery vehicles smd methods may be employed, including 

20 polycationic condensed DNA linked or unlinked to killed adenovirus alone, for 
example Ctiriel, Hum. Gene Ther. (1992) 3:147-154; Hgand linked DNA, for 
example see Wu, J. Biol. Chem. (1989) 264:16985-16987; eukaryotic cell 
delivery vehicles cells, for example see U.S. Ser. No. 08/240,030, filed May 9, 
1994, and U.S. Ser. No. 08/404,796; deposition of photopolymerizedhydrogel 

25 materials; hand-held gene transfer particle gun, as described in U.S. Pat. No. 
5,149,655; ionizing radiation as described in U.S. Pat. No. 5,206,152 and in 
WO92/11033; nucleic charge neutralization or fusion with cell membranes. 
Additional approaches are described in Phihp, Mol. Cell Biol. (1994) 14:2411- 
2418, andin Woffendin, Proc. Natl. Acad. Sci. (1994) 91:1581-1585. 
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Naked DNA may also be employed. Exemplary naked DNA introduction 
methods are described in WO 90/11092 and U.S. Pat, No. 5,580,859. 

Fvirther non- viral delivery suitable for use includes mechanical delivery 
systems such as the approach described in Woffendin et al., Proc. NatL Acad. 
5 Sci. USA (1994) 91(24):11581-11585. 

The surprisingly intricate relationship between the gene sequences as 
disclosed herein and the development of mahgnancy as unraveled by the 
inventors signifies the presence of novel routes of cancer development. 

Due to this finding the identification of additional risk factors, novel 
10 therapeutic interventions and pharmaceuticals and the treatment and 
prophylaxis of cancers, and in particular of leukemia has now become 
available, which will ultimately result in a reduction in occxirrence and/or 
progression of this disease as well as unproved metrhods for diagnosis and 
prognosis. 

15 The present invention now also provides for a method of treatment 

comprising modulating the expression of gene sequences as described herein. 

A further object of the present invention is to provide for coordinated 
design and discovery of new drugs for the treatment of cancers, and in 
particular of leukaemia, and related diseases as well as providing compositions 

20 comprising modulators of such genes or their expression products which can 
serve as a basis or an ingredient of a pharmaceutical composition. The present 
invention therefore also relates to pharmaceutical products that comprise such 
modulating compositions. As a further object, the present invention provides 
the use of at least one modulator for the manufactiire of a medicament for the 

25 treatment and/or prevention of cancers, and in psurticular of leukemia, and/or 
related disease. 

The present invention will now be illustrated by way of the following 
non limiting examples. 



30 



EXAMPLE 1 
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Identification of common viral insertion (CIS^ 

INTRODUCTION 

To identify common viral integration sites in mouse tumors and tumor 
5 derived cell lines, the MuLV virus was used. Mice were either infected with 
murine leukemia virus 1.4 (Graf&-1.4 MuLV) or with Cas-Br-M MuLV. The 
Grafi&-MuLV is an ecotropic retroviral complex causing leukemias in mice. 
This viral complex does not contain oncogenic sequences itself hut rather 
deregtdates genes due to proviral integrations. Graffi-1.4 MuLV is a subclone 
.0 of this complex and predominantly induces myeloid leukemias. NIH/Swiss 

mice infected with Cas-Br-M MuLV develop myeloid or lymphoid maUgnancies 
also as a result of retroviral insertion that affect target genes. 

MATERIALS AND METHODS 

15 1. Induction of leukemias 

Newborn FVB/N mice or NIH/Swiss were injected subcutaneously with 
100 jil of a cell culture supernatant of Graffi-1,4 MuLV or Cas-Br-M MuLV 
producing NIH3T3 cells, respectively. Mice were checked daily for symptoms of 
illness, i.e., apathy, white ears and tail, impaired interaction with cage-mates, 

20 weight-loss, and dull fur. TsrpicaUy, leukemic mice suffered from enlarged 
spleens, hvers, thymuses, and lymph nodes. From these primary tumors, 
chromosomal DNA was isolated for PCR-based screening. Blood samples were 
taken from the heart. For morphological analysis, blood smears and (grtospins 
were fixed in methanol, May-Griinwald-Giemsa (MGG) stained and analyzed. 

25 Single-ceU suspensions of different organs were analyzed by flow 

cytometry using a flow cytometer. The cells were labeled with the following rat 
monoclonal antibodies: ER-MP54 (ER-MP54), ER-MP58 (ER.MP58), Ml/70 
(Mac-1), F4/80 (F4/80), RB68C5 (GR-1), ER-MP21 (transferrin receptor), 
TER119 (Glycophorin A), 59-AD2.2 (Thy-1), KT3 (CD3), RA3 6B2 (B220) and 
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E13 161-7(Scal). Immunodetection was performed utilizing a Goat-anti-Rat 
antibody coupled to fluorescein isothiocyanate (G„Ra-Fitc) 

2. Inverse PGR on Graffi-1.4 MuLV induced leukemias 

Genomic DNA from the primary tumors was digested with Hhal. After 
ligation, a first PGR was performed using Graf&-1.4 MuLV (LTR) specific 
primers LI (ff TGGAAGATGGCGTTAGTGTAGGTAG 3') (SEQ ID NO: 1) and 
L2 (5' CCAGGTTGCCGCAAAGAGCTG 3') (SEQ ID NO: 2) (cycling conditions 
were 1 min at 94°G, 1 min at 65**G, 3 min at 72**G [30 cycles]). For the second 
nested PGR the primers LIN (5' AGCCTTATGGTGGGGTGTTTG 3') (SEQ ID 
NO: 3) and L2N (5' AAAGACCTGAAACGACCTTGC 3') (SEQ ID NO: 4) (15 
cycles) were used. The PGR reaction mixtirre contained 10 mM Tris-HCl (pH 
8.3), 50 mM KGL, 1.5 mM MgCk, 200 ^iM dNTFs, 10 pmol of each primer and 
2.5. units of Taq-polymerase. The PGR fragments were analyzed on a 1% 
agarose gel and cloned in a TA cloning vector according to standard 
procedures. PGR products were sequenced with the M13 forward primer 5' 
GAGCGGGAGGAAAATG 3» (SEQ ID NO: 5) and M13 reverse primer 5' 
CAGGAAACAGCTATGAC 3' (SEQ ID NO: 6). Virus flanking genomic 
sequences were identified using the National Genter for Biotechnology 
Information (NGBI) and Gelera databases 

3. Inverse PGR and real-time PGR on Gas-Br-M MuLV leukemias 

Five \ig of genomic DNA was digested with Sau3A or with SstL The 
products were treated with T4-Ugase, which restdted in the formation of 
circularized products. Subsequently, an inverse PGR (ICPR) strategy was used 
with primers specific for the Gas-Br-M MuLV LTR. For the Sau3A 
digested/hgated fragments, the first PGR reaction was carried out with 
primers pLTR4 (5'-CGG AAA GTG GTT AGG ACA- 3') (SEQ ID NO: 7) and 
pLTR3 (5'-GGT GTG GTG AGA GTG ATT-30 (SEQ ID NO: 8), foUowed by a 
nested PGR using pLTR5 (5^AGG AGA GAT ATG GTG TTT-30 (SEQ ID NO: 9) 
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and pLTR6 (5'-GTG ATT GAG TAG GGG GTT-3') (SEQ ID NO: 10). Cycle 
conditions for both PCRs were 15" at 94°C, 30" at 57«'C, and 2' at 72°C for 10 
cycles, and an additional 20 cycles following the conditions 15" at 94°C, 30" at 
57°C, and 2'30" at 72°C. Reactions were performed using Expand High Fidelity 
5 PGR System. In case of SstI digested genomic DNA, the drcularized DNA was 
amplified using primers pLTR9 (5'-GAG TCA GTG TAT GGG AGG AG-3') 
(SEQ ID NO: 11) and pLTRl (5'-GTT GGT GTT GGA TGG GAG TGG-SO (SEQ 
ID NO: 12), and pLTRlO (5'-GTG AGG GGT TGT GTG GTC-3') (SEQ ID NO: 
13) and pLTR2 (5'-GTG TGG CTG TTG GTT GGG AGG-3') (SEQ ID NO: 14), 

10 respectively. The first PGR was performed for 30 cycles 30" at 94"C, 1' at 60°G, 
and 3' at 72°C. The reaction was carried out with E3q)and High Fidehty PGR 
system. Nested PGR conditions were 30 cycles of 30" at 94°C, 1' at dS'C, and 1' 
at 72°G. This reaction was performed with Taq polsonerase. 

For RT-PGR, total RNA was isolated throi^h a GsGl gradient. First 

15 strand cDNA was obtained by reverse transcriptase (RT) reactions with an 

oUgo(dT)-adapter primer (5'-GTG GGG AAT TGG TGG AGG GG(dT)i6-3') (SEQ 
ID NO: 15) at 37»G with 5^g RNA, using the Superscript™ PreampUfication 
System. Subsequently, PCRs (1' at 94°C, 1' at 58°C, 3' at 72''G (30 cycles)) were 
performed on the RT reactions of the leukemias by using the LTR specific 

20 primer pLTRG and the adapter primer (5'-GTC GGG AAT TGG TGG AGG CG- 
3') (SEQ ID NO: 15). PGR products were directly cloned into pGR2.1 and 
subjected to nucleotide sequencing with the M13 forward primer 
5'GACGGGCAGGAAAATG 3' (SEQ ID NO: 5) and ml3 reverse primer 
5'CAGGAAACAGGTATGAG 3' (SEQ ID NO: 6). Nucleotide sequences were 

25 compared to the NGBI and Gelera databases for analysis. 



Results 

1. Gra£Ci-1.4 MuLV induced leukemias 

Leukemias developed 4 to 6 months after subcutaneous injection of 
30 newborn FVB/N mice with Gra£&-1.4 MuLV. Forty-eight of the 59 leukemias 
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(81%) analyzed exhibited morphological characteristics of myeloid cells. Blast 
cell percentages in the bone marrow ranged from 24 to 90% with an average of 
48%. Leukemia cells expressed immunophenotjrpic marker profiles consistent 
with their myeloid appearance, e.g., ER-MP54+, ER-MP58+, CD3-, GR-1+. Six 
5 leukemias with blast-hke morphology showed no immimophenotypic 
differentiation markers, suggesting that these tumors represented very 
immature leukemias. Only 3 leukemias were of T-lymphoid origin 
(CD3+/MP58-/Thyl+) and 2 showed mixed myeloid and erjrthroid features 
(Terll9+/ER-MP58+/F4/80+). These results demonstrate that Graf&-1.4 MuLV 

10 infection predominantly induce myeloid leukemia in FVB/N mice. 

The provirus flanks were cloned, subjected to nucleotide sequencing, 
and blasted against the Celera and NCBI databases resulting in the 
identification of common insertion sites. Of the genes identified, the ones that 
were so far not described to be involved in tumor development are Hsted in 

15 Table 1 combined with the novel cancer genes identified firom Cas-Br-M MuLV 
induced leukemias. 

2. Cas-Br-M MuLV induced leukemias 

Cas-Br-M MuLV injected newborn NIH/Swiss mice developed lexikemias 
20 by approximately 140 to 400 days postinoculation. Most of these were myeloid 

levikemias (59%), although T-cell (21%), and mixed T-cell/myeloid (21%) 

leukemias were found. 

To clone viral integration sites, a virus-LTR (long terminal repeat) 

specific inverse PGR as well as RT-PCR were applied as complementary 
25 approaches using DNA or RNA firom 35 myeloid leukemias. The inverse PGR 

method was carried out on 19 primary leukemias and 9 cell lines, whereas the 

RT-PCR based technique was performed on 12 cell lines and 2 primary 

leukemias. 



30 against the Gelera and NCBI databases resulting in the identification of 



The provirus flanks were subjected to nucleotide sequencing, blasted 
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common insertion sites. Of the genes identified, the ones that were so far not 
described to be involved in timior development are listed in Table 1 combined 
with the novel cancer genes identified from Graf&-1.4 MnLV induced 
lenkemias. 

5 

DISCUSSION 

Although proviral integrations occur randomly, they may affect the 
expression or function of nearby genes. If a gene is affected in two or more 
independent tumors, this indicates that these integrations provide a selective 

10 advantage and therefore contribute to tumor development. Multiple of these 
common insertion sites were identified of which a large nimiber are 
demonstrated for the first time to play a role in cancer. Importantly, several of 
the other genes identified are well-known cancer genes validating the 
approach. This example shows that the ptirsued strategy can be successfully 

15 used to identify novel genes that are involved in tumor development. 

EXAMPLE 2 

Definition of novel routes for development of AML 

MATERIAL AND METHODS 
20 GraffH.4 MuLV-induced leukemias 

Newborn mice were injected subcutaneously with 100 jil of cell culture 

supernatant of Graffil.4-MTxLV producing NIH3T3 cells (a gift firom Dr. E. 

Rassart, Department des Siences Biologiques, Universite du Quebec a 

Montreal, Montreal, Quebec, Canada). Mice were treated and analyzed for the 
25 development of leukemia as previously described (Erkeland, et ah, 2003, Blood 

101:1111-7). Chromosomal DNA was isolated from the leukemic cells for PCR- 

based screening (Erkeland, et al, 2003, Blood 101:1111-7). 



Cytological analysis and immunophenotyping of the leukemic cells 
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For morphological analysis, blood smears and cytospins were fixed in 
methanol, stained with May-Griinwald-Giemsa, and examined using a Zeiss 
Axioscope microscope (Carl Zeiss BV, Weesp, The Netherlands). Single-cell 
suspensions of different organs were analyzed by flow cytometiy using a 
5 FACScan flow cytometer (Becton Dickinson aiid Co, MoxintfljSLView, CA, 

USA). Cells were labeled as described previoxisly (Joosten et al, 2000. Virology 
268:308-18) with the following rat monoclonal antibodies: ER-MP54, ER- 
MP58, Ml/70 (Mac-1), F4/80, RB68C5 (GR-1), ER-MP21 (transferrin receptor), 
TER119 (Glycophorin A), 59-AD2.2 (Thy-1), KT3 (CD3), RA3 6B2 (B220) and 
10 E13 161-7 (Seal). Immunodetection was performed using goat-anti-rat 

antibodies coupled to fluorescein isothiocyanate (GaRa-FITC) (Nordic, Tilburg, 
The Netherlands). 



Inverse PGR on Gra£Ei-1.4 MuLV induced leukemias 
15 Genomic DNA firom the primary txmiors was digested with Hhal 

(CGCG). After circularization by ligation (Rapid hgation kit, Roche 
Diagnostics, Mannheim, Germany), a first PGR was performed using Graffi-1.4 
MuLV (LTR) specific primers LI and L2 as described in Example 1. Cycling 
conditions were 1 min at 94°C, 1 min at 62°C and 3 min at 72°C for 30 cycles. 
20 For the second nested PGR, the primers LIN sind L2N as described in Example 
1 (15 cycles) were used. The PGR reaction mixture contained 10 mM Tris-HGl 
(pH 8.3), 50 mM KGL, 1.5 mM MgCk, 200 |jM dNTP's, 10 pmol of each primer 
and 2.5 units of Taq-polymerase (Pharmacia, Uppsala, Sweden). The PGR. 
fragments were analyzed on a 1% agarose gel. 

25 

Detection of virus integrations by specific nested PGR 

To determine the locahzation of the Graffi-1.4 provirus in particular 
virus-targeted genes in an extended panel of lexikemias, a nested PGR was 
performed on DNA from primary tmnors. For the first PGR, virus integration 
30 site (VIS) locus specific primers XI and X2 were used in combination with 
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Gra£fi-1.4 MuLV LTR specific primers Ll and L2 (see also Figure 1). Cycling 
conditions were 1 min 94^0, 1 min 62 °C, 3 min 72°C for 30 cycles. For the 
second PGR, nested VIS-specific primers XIN and X2N (i.e. genomic region 
specific primers) were used in combination with nested LTR specific primers 
LIN and L2N under the same conditions. The obtained PGR products were 
analyzed by Southern blotting. To verify the correct nature of the amplified 
bands, the blots were hybridized with radiolabeled gene specific probes PI and 
P2 at 45^G in Church buffer (0.5 M phosphate buffer, pH 7.2, 7% (w/v) SDS, 10 
mM EDTA) overnight. Signals were visualized by autoradiography according 
to standard procedures (Maniatis et aL, 1982, Molecular Cloning, A Laboratory 
ManuaL Gold Spring Harbor Laboratory, Gold Spring Harbor, New York). 

Nucleotide sequence analysis 

PGR products were sequenced using an ABI 3100 sequencer (Applied 
Biosystems, Nieuwerkerk a/d IJssel, The Netherlands) with the Graf&-1.4 
MuLV specific forward primer Ll. Virus flanking genomic sequences were 
analyzed using GenBank (National Center for Biotechnology Information), 
Celera Discovery System (Gelera Genomics, Rockville, MD, USA) (Hogenesch 
et al., 2001, GeU 106:413-6; Kerlavage et aL, 2002, Nucleic Acids Res. 30:129- 
36) and Ensembl (Wellcome Trust Genome Campus, Hinxton, 
Cambridgeshire, UK) (Hubbard et al, 2002, Nucleic Acids Res. 30:38-41). 

Results 

Gra£fi-1.4-induced leukemia 

Eighty-nine newborn FVB/N mice were inoculated with GraflEi-1.4. When 
moribund, mice were sacrificed and hematopoietic organs were isolated. Six 
mice died without signs of leukemia and were excluded from further 
investigation. Standard blood cell analysis was performed and values 
compared with the mean of 10 normal FVB/N mice (Joosten et al, 2002, Exp. 
Hematol. 30:142-9). Most of the leukemic mice had increased numbers of 
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peripheral white blood cells and decreased numbers of platelets and red blood 
cells compared to normal controls (data not shown). Blast percentages in the 
bone marrow ranged from 24 to 90% with an average of 48%. Leukemic cells 
from 76 mice were immimophenotyped. The major immunophenotypic features 
5 of these leukemias are given in Table 2. 



Table 2 





Leukemia type 


Immunophenotype ' 


No. of 
leukemias 


I 


T-lymphoid 

Markers 


MP21^ CDS'", Thyl'' 


2 


n 


Mixed lymphoid, 
erythroid, myeloid 
differentiation 
markers 


Grl*, F4/80*, Macl*, Imm"'(a), CDS"", 
B220*, (gcsfr*)(h), (Terll9"^ 


12 


• m 


Myeloid 
differentiation 
Markers 


Imm*, MP21^ (ft/SO^.Grl"", B220\ Mac- 


43 


IV 


myelo-monocytic 
blasts 


Inim"", GTWgcsJr, (F4/80'^,(Macl+), 
(6220"^ 


15 


V 


Erythroid 


Terll9^ MP2r (Scan 


4 


Total 






76 



a(Imm"*") indicates positive staining for immature hematopoietic cell markers 
Scal+, MP58+, MP54+ (Thyl+). ^Markers between brackets are not 
10 consistently expressed on all individual tumors. 

Based on these criteria, the leukemias were classified as T-lymphoid, 
mixed lymphoid/erythroid/myeloid, myeloid, myelo-monocytic or erjrthroid. 
Fifty-nine lexikemias were analyzed morphologically. Almost all of the mice 
15 showed splenomegaly of which 25% thymus enlargement, 20% with lymph 

node enlargement and 55% Uver involvement. In 5% of the mice, leukemic cells 
were present in the BM and blood, without overt peripheral organ 
involvement. 



20 Virus integration sites in 6r-1.4 induced leukemias 
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PGR analysis in conjunction with database searches identified a total of 
94 different virus integration sites out of 69 tumors, 38 of which had been 
found previously in other studies. Examples of this latter group of CIS are p53, 
Notch-1, Evi-1, NFl CEvi-2), Lck-1, Pim-1, HoxAB (Evi-6), Fli-1 and N-Myc. 
5 Notably, 79 of the 94 integrations are CIS directly implicating the affected 
genes in leukemic transformation. The remaining 15 have been included in 
this report because closely related fam ily members of these genes were found 
in other studies. Fifty-six of the identified sequences were mapped near or in 
novel candidate leiikemia genes. The products of the affected genes have been 

10 classified in the following categories: receptors and signaling molecules (Table 
3), regulators of transcription (Table 4) and the remaining group of gene 
products with regulatory roles in other pathways (e.g., DNA stability and 
proteasomal targeting) or with currently unknown functions (Table 5). 

Approximately 15% of the virus integrations identified in this inverse 

15 PCR-based screen were determined to be common (more than 2 integrations in 
a particular genomic locus) in the initial screen. To determine the frequency of 
common integrations more sensitively, we performed integration specific PGR 
reactions on 49 Gr-1.4 target genes (tables 3-5) as described elsewhere 
(Erkeland, et ah,, 2003, Blood 101:1111-7; Joosten et al, 2002, Oncogene 

20 21:7247-55) of which most (96%) appeared to be common. The genes flanking 
the virus integration sites were compared with data fi:om the National Cancer 
Institute retroviral tagged cancer gene database (RTCGD) at 
http://genome2.ncifcrf.gov and other sources (Table 3). The Gr-1.4 integrations 
that are present in this database or were reported in other studies have been 

25 marked with a reference in Tables 3, 4 and 5. Family members of Gr-1.4 
targeted genes found in other studies are hsted in the column "identified 
family member". Most of the virus integrations occurred in or near 5' or 3* ends 
of the genes, suggesting that expression levels of these genes are deregulated 
as a consequence of virus integration. Eighteen CIS were exclusively located 

30 within the gene, conceivably causing gene disruption. This group comprises the 
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previously reported virus targets Notch-1, NFl, PTPNl, Inpp4a, p53, SWAP70 
and KcnkS and 11 new genes: calcyphosine, phosphodiesterase-1, ELL, NCOR- 
1, HDAC-7A, histone H3.3A, api-5, RIL, D6Min5e and two unknown genes. 

5 Discussion 

In this study, we have employed in vivo retroviral mutagenesis using 
the Graffi-1.4 (Gr-1.4) virus complex to identify novel routes for the 
pathogenesis of acute myeloid leukemia (AML) by identifying the disease 
genes specifically involved therein. In comparable studies with other virus 

10 strains, e.g., Moloney, AKXD or Cas-Br-M, the most prominently appearing 
tumor types are T and B cell Ijonphomas and in case of BXH2 myelomonocytic 
tumors. In contrast, more than 80% of the leukemias iaduced by Gr-1.4 
imequivocally exhibited immunophenotypic characteristics of myeloid cells, 
with immatxure morphological features (mainly myeloblasts), emphasizing the 

15 unique features of the Gr-1.4 virus complex as a tool for characterizing 
pathogenetic mechanisms in myeloid disease (see table 2). 

Indeed, the majority of the CIS described here have thus far not been 
reported in the extensive screens in the lymphoma models, although some 
overlap was observed. The latter is not svirprising in view of the fact that ceU 

20 type specific events usually impinge on downstream common regulatory 
pathways that can be affected in multiple tumor types. 

Although most of the Gr-1.4 CIS have been linked to candidate disease 
genes based on their proximity to these genes, proviral integrations have been 
reported to influence gene expression over distances of more than 100 kb. 

25 Thus, it cannot be excluded that genes located more distantly firom the CIS are 
also deregulated and that multiple genes are affected by a single CIS. In 
addition, three other aspects of the retroviral screens as they are currently 
being performed must be emphasized. First, malignancies induced by 
replication competent retroviruses are usually oligo- or polyclonal rather than 

30 monoclonal. Although this gives rise to high firequencies of CIS in relatively 
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small cohorts of mice, it complicates the search for cooperating events within 
one leukemic clone. Second, the sensitive PCR-based techniques used to 
identify virus integrations do not allow distinction between CIS present in a 
majority of the leukemic cells, initiating an early pathogenic event, and CIS 
5 present in only a minor population of cells, probably affecting leukemia 
progression genes. Finally, a recent study has emphasized that murine 
leukemia viruses have a preference for integration near transcriptional start 
sites (Wu, et al., 2003, Science 300:1749-51). Our present data with Gr-1.4 
corroborate this conclusion and indicate that retroviral mutagenesis with 

10 MuLV preferentially, although not exclusively, identifies gene deregulation 
rather than gene disruption. 

In conclusion, using high-throughput retroviral screens with the Gr- 
1.4 virus complex we have identified novel pathways involved in myeloid 
leukemia. Currently, we are studying the consequences . of aberrant gene 

15 expression or function employing gene transfer methodology in 32D cells and 
primary bone marrow cultures. 

Conclusion 

Acute myeloid leukemia (AML) is a heterogeneous group of diseases, 
20 in which chromosomal aberrations, small insertions/deletions, or point 
mutations in certain genes have profoimd consequences for prognosis. 
However, the majority of AML patients present without currently known 
genetic defects. Retroviral insertion mutagenesis in mice has become a 
powerful tool to identify new disease genes involved in the pathogenesis of 
25 leukemia and lymphoma. Here we have used the Graf&-1.4 murine leukemia 
virus (Gr-1.4) strain, which causes predominantly acute myeloid leukemias, in 
a screen to identify novel genes involved ia the pathogenesis of this disease. 
We report 79 candidate disease genes in common virus integration sites (CIS) 
and 15 genes of which family members were previously foxmd to be affected in 
30 other studies. The majority of the identified sequences (60%) have not been 
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foxind in previous screens in lymphomas and monocytic lenkemias, suggesting 
a specific involvement in AMXi. Although most of the virus integrations 
occurred in or near the 5* or 3* ends of the genes, suggesting deregulation of 
gene expression as a consequence of virus integration, 18 CIS were exclusively 
5 located within the gene, conceivably causing gene disruption. 

EXAMPLES 

Large scale identification of novel potential disease loci in mouse lexikemia 
MATERIAL AND METHODS 
10 Cas-Br-M MuLV-induced leukemias 

The NIH-Swiss derived Cas-Br-M MuLV-induced primary txunors used 
in this study are: for IPCR: CSL 13, 16, 20, 22, 26, 30, 31, 32, 33, 65, 71, 78, 82, 
90, 91, 93, 111, 117, 123, 201, 203, 204, 212, 221, 227, 228, 237 and 239; and 
for RT-PCR: CSL: 201 and 203 (Joosten et aL 2000. Virology 268:308-18). Cells 

15 isolated from leukemic spleens were stored vitally in ahquots in liquid 

nitrogen. The following leukemia cell lines were utilized in this study: for 
IPCR: DA 24, and NFS 22, 36, 56, 58, 60, 61, 78, and 124; and for RT-PCR: DA 
1, 2, 3, 8, 33, and NFS 22, 36, 58, 61, 78, 107 and 124 (Valk et al, 1997 Blood 
90:1448-1457). These cell lines were cultured in RPMI plus 10% fetal calves' 

20 serum (FCS) (Gibco Life Technologies Inc., Gent, Belgium) and 10 Units of 
mouse IL3. The frequency of virus integrations was determined both on cell 
Hues, and primary leukemias. 

IPCR and RT-PCR 

25 Isolation of genomic DNA was carried out exactly as described 

previously (Valk et al, 1999 J. ViroL 73:3595-602). Five pg of genomic DNA 
was digested with SauSA, Pvull or Sstl (GIBCO Life Technologies Inc., Gent, 
Belgium). The products were treated with T4-hgase (GIBCO Life Technologies 
Inc.), which resulted in the formation of circularized products. Subsequently 

30 we performed an IPCR strategy using primers specific for the Cas-Br-M MuLV 



wo 2004/016317 



PCT/NL2003/G00583 



74 



LTR. For the Sau3A digestedy%ated jGcagments, the first PGR reaction was 
carried out with primers pLTR4 (5'-CCGAAACTGCTTACCACA-3') (SEQ ID 
NO. 7) and pLTR3 (5'-GGTCTCCTCAGAGTGATT-3") (SEQ ID NO. 8), foUowed 
by a nested PGR using pLTR5 (5'-ACCACAGATATCCTGTTT-3') (SEQ ID NO. 
5 9) and pLTR6 (5'-GTGATTGACTACCCGCTT-3') (SEQ ID NO. 10). Cycle 

conditions for both PCRs were 15 s at 94''C, 30 s at 57''C, and 2 miin at 72'*C for 
10 cycles, and an additional 20 cycles following the conditions 15 s at 94'*C, 30 
8 at 57''C, and 2 min 30 s at 72**C. Reactions were performed using E^qpand 
High Fidelity PGR System (Roche, Mannheim, Germany). For PvuII and SstI 

10 digested genomic DNA, the circularized DNA was amplified using primers 

pLTR7 (5'-GACTCAGTCTATCGGAGGAC-3') (SEQ ID NO. 11) andpLTRl (5'- 
CTTGCTGTTGCATCCGACTGG-3') (SEQ ID NO. 12), and pLTRS (5'- 
GTGAGGGGTTGTGTGCTC-3') (SEQ ID NO. 13) and pLTR2 (5'- 
GTCTCGCTGTTGCTTGGGAGG-3') (SEQ ID NO. 14) respectively. 

15 The first PGR was performed for 30 cycles 30 s at 94'»G, 1 min at 54°G, 

and 3 min at 72'*G. Reactions were carried out with Expand High Fidelity PGR 
system (Roche). Nested PGR conditions wore 30 cycles of 30 s at 94^C, 1 Tnin at 
58**C, and 1 min at 72^C. This reaction was performed with Taq polymerase 
(Amersham Pharmacia Biotech, Roosendaal, The Netherlands). RT-PCR was 

20 carried out as described previously by Valk et ah (1997c). Briefly, total RNA 
was isolated through a CsCl gradient. First strand cDNA was obtained by 
reverse trsuiscriptase (RT) reactions with an oligo(dT) -adapter primer (5*- 
GTCGCGAATTCGTCGACGCG(dT)i5-3') (SEQ ID NO. 15) at 37°C with 5 ^ig 
RNA, using the Superscript M Preamphfication System (Life Technologies, 

25 Breda, The Netherlands) according to the instructions of the mantifacturer. 
Subsequently, PCRs (1 min at 94°C, 1 min at 58°C, 3 min at 72°C (30 cycles)) 
were performed on the RT reactions of the leukemias by using the LTR specific 
primer pLTR6 and the adapter primer (5'-GTCGCGAATTCGTCGACGCG"3') 
(SEQ ID NO. 15). PGR products were directly cloned into pCR2.1 (Invitrogen, 

30 Breda, The Netherlands) according to the instructions of the manufacturer and 
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subjected to nucleotide sequencing. Nucleotide sequences were compared to the 
NCBI database for analysis. 

PGR analysis and Southern blot hybridization 
5 One ug of genomic DNA isolated from primary leukemias or cell lines 

was subjected to PGR with primer pLTRl and a locus specific primer A (Figure 
2). Locus speci&c primers of 17-21 nucleotides were designed speci&c for each 
of the sequences of the fragments generated by Inverse- or RT-PGR. Primers 
were purchased from Life Technologies. One fil of PGR product was transferred 

10 to a nested PGR reaction using primer pLTR2 and a nested locus specific 
primer B. Cycle conditions for both reactions were 1 cycle 5 min at 94°C, 30 
cycles 30 s at 94^G, 1 min Tm, 1 min 30 s at 72°G, 1 cycle 5 min at 72**C. Tm 
was specific for each primer and was between 48'*G and 62''G. PGR was carried 
out using Taq polymerase (Amersham Pharmacia Biotech). PGR products were 

15 electrophoresed in a 1.5% agarose gel and transferred to Hybond-N+ nylon 
membranes (Amersham Pharmacia Biotech) with 0.25 M NaOH/1.5 M NaGL 
Membranes were hybridized with a ^zp.end-labeled locus specific primer G. 
Labeling was carried out using T4Mnase (USB, Gleveland, OH, USA) according 
to the instructions of the provider. Subsequently, blots were stripped in 0,4 M 

20 NaOH for 30 min at 45^0, neutrahzed using 0.2 M Tris-HGl, 0.2% SDS, and 
0.1 X SSG for 15 min at 45^0 and hybridized with ^zp-end-labeled Gas-Br-M 
MuLV specific primer pLTRS. Blots were e3q)osed for autoradiography with a 
KODAK film and an intensifying screen. After 15 min of exposure films were 
developed and analysed. 

25 

Sequencing analysis 

Samples were prepared using the Bd-sequencing kit according to 
instructions from the provider (PE Biosystems, Nieuwerkerk a/d IJssel, The 
Netherlands) and nucleotide sequencing was carried out on an ABI 310 
30 automatic sequencer (PE Biosystems) using primers M13 forward (5'- 
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GTAAAACGACGGCCAGT-30 (SEQ ID NO. 24) and M13 reverse (5'- 
GGAAACAGCTATGACCATG-30 (SEQ ID NO. 25). Sequences isolated by 
IPCR or RT-PCR were compared to the data present in the Celera Discovery 
System (Celera Genomics, RockviUe, MD, USA, database contents May 2002) 
5 and in the GenBank of the National Center for Biotechnology Information 

(NCBI, Bethesda, MD, USA). The exact site of integration was determined. For 
insertions found outside a gene, the distance between the integration and the 
most nearby gene was calculated. The location on the mouse chromosome was 
established and the human equivalent was deduced by using the human 
10 databases of Celera Discovery System (May 2002), LocusLink or human mouse 
homology maps (both www.ncbi.nlm.nih.gov). 

Results 

Applying virus LTR-specific inverse-PCR and RT-PCR combined 

15 with automated sequencing on CasBR-M MuLV induced myeloid leiikemias, 
126 virus integration sites were cloned. Using locus- and LTR-specific primers, 
a nested PCR/Southern-blotting procedure was developed on genomic DNA 
from a large panel of MuLV-induced leukemias, to analyse whether a 
particular virus insertion represented a common virus integration site (cVIS). 

20 In fact 39 out of 41 integrations analysed this way represented a common viral 
integration site. We recognized six previously cloned cVISs, i.e. Evil, HoxaT, 
cMyb, Cb2/Evill, Evil2, and Hisl and 33 novel common insertions. Among 
this group we found integrations in or near genes encoding nuclear proteins, 
e.g. Dnmt-2, Nm23'M2, Ctbpl or Erg, within receptor genes, e.g. Cb2 or mrcl, 

25 novel putative signaling or transporter genes, the ringfinger-protein gene 
Midi and a panel of genes encoding novel proteins with unknown function. 
The finding that 39 out of 41 integrations analysed represented a cVIS, 
suggests that the majority of the other virus insertions that were not yet 
analysed by the PCR/Southern-blotting method are located in e cVIS as well 

30 and may therefore also harbor novel disease genes. 
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Histone-lysine N- 
methyltransferase, H4 lysme- 
20 specific (Histone H4-K20 
methyltransferase) (H4-K20- 
HMTase) (SET domain- 
containing protein 8) 
(PR/SET domain-containing 
protem 07) (PR/SET07) (PR- 
Set7) [Rattus norvegicus]; 
SET domain-containing 
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PR/SET domain containing 
protein 07 (SET07) 
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DPM3 dolichyl-phosphate 
mannosyltransferase 
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prostin I; dolichol-phosphate 
mannose synthase subunit 3; 
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mannosyltransferase subunit 
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dohcholsynthase subunit 3, 
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related C3HC4 RING domain 
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